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1 Introduction 

This article, which is an accompanying paper to [BLS09], consists of two parts: In section 2 
we present a version of Fcnchcl's perturbation method for the duality theory of the Monge- 
Kantorovich problem of optimal transport. The treatment is elementary as we suppose that 
the spaces (X, /i), (Y,v), on which the optimal transport problem [Vil03, Vil09] is defined, 
simply equal the finite set {1, . . . , N} equipped with uniform measure. In this setting the 
optimal transport problem reduces to a finite-dimensional linear programming problem. 

The purpose of this first part of the paper is rather didactic: it should stress some 
features of the linear programming nature of the optimal transport problem, which carry 
over also to the case of general polish spaces X, Y equipped with Borel probability measures 
fj,, v, and general Borel measurable cost functions c : X xY — > [0, oo]. This general setting is 
analyzed in detail in [BLS09]; section 2 below may serve as a motivation for the arguments 
in the proof of Theorems 1.2 and 1.7 of [BLS09] which pertain to the general duality theory. 

The second — and longer — part of the paper, consisting of sections 3 and 4 is of a quite 
different nature. 

Section 3 is devoted to illustrate a technical feature of [BLS09, Theorem 4.2] by an 
explicit example. The technical feature is the appearance of the singular part h s of the dual 
optimizer h G L X {X x Y, n)** obtained in ([BLS09, Theorem 4.2]). In Example 3.1 below we 
show that, in general, the dual optimizer h does indeed contain a non-trivial singular part. 
In addition, this example allows to observe in a rather explicit way how this singular part 
"builds up" , for an optimizing sequence (<p n © ip n )^ =1 £ L'(I x Y, n) which converges to h 
with respect to the weak-star topology. The construction of this example, which is a variant 
of an example due to L. Ambrosio and A. Pratelli [AP03], is rather longish and technical. 
Some motivation for this construction will be given at the end of Section 2. 

Section 4 pertains to a modified version of the duality relation in the Monge-Kantorovich 
transport problem. Trivial counterexamples such as [BLS09, Example 1.1] show that in the 
case of a measurable cost function c : X x Y — » [0, oo] there may be a duality gap. The main 
result (Theorem 1.2) of [BLS09] asserts that one may avoid this difficulty by considering a 
suitable relaxed form of the primal problem; if one does so, duality holds true in complete 
generality. In a different vein, one may leave the primal problem unchanged, and overcome 
the difficulties encountered in the above mentioned simple example by considering a slightly 
modified dual problem (cf. [BLS09, Remark 3.4]). In the last part of the article we consider 
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a certain twist of the construction given in section 3, which allows us to prove that this dual 
relaxation does not lead to a general duality result. 



2 The finite case 

In this section we present the duality theory of optimal transport for the finite case: Let 
X = Y = {1, . . . , N} and let p = v assign probability N^ 1 to each of the points 1, . . . , N. 
Let c = (c(i,j))^j =1 be an R + -valued N x N matrix. 

The problem of optimal transport then becomes the subsequent linear optimization prob- 
lem 

JV TV 

(c,7r> :=E I>(M')c(i,j) -min, ttGR^ 2 , (1) 
»=i j=i 



under the constraints 

TV 



J2n(i,j) = N-\ i=l,...,N, 



TV 



^{i,j) = N- 1 , j = l,...,N, 

i=i 

n(i,j)>0, i,j = l,...,N. 

Of course, this is an easy and standard problem of linear optimization; yet we want 
to treat it in some detail in order to develop intuition and concepts for the general case 
considered in [BLS09] as well as in section 3 . 

For the two sets of equality constraints we introduce 2N Lagrange multipliers ((p(i))f =1 
and (V'C?))^! taking values in R, and for the inequality constraints (4) we introduce La- 
grange multipliers (pij)^j = i taking values in R + . The Lagrangian functional L(ir, ip, ip 7 p) 
then is given by 

JV TV 

L{n,V,ip,p) = ^2^2c{i,j)ir(i,j) 
i=i j=i 

-EvwlE^i)-^- 1 
»=i \j=i 

TV / TV \ 

-E^') E^-j)-^" 1 
j=i \*=i / 

TV TV 

-EE^*'^*'^' 
»=i j=i 

where the tp(i) and -00) range in R, while the p(i,j) range in R + . 

It is designed in such a way that 

C(tt) := sup L(ir,ip,ip,p) = (c,tt) +Xn( M ,L/)(7r), 

¥>>V>,P 

where II(^, v) denotes the admissible set of 7r's, i.e., the probability measures on X x Y 
with marginals p and v, and \a( ■ ) denotes the indicator function of a set A in the sense 
of convex function theory, i.e., taking the value on A, and the value +00 outside of A. 
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In particular, we have 



P := inf C(tt) = inf sup L(w, ip, ip, p), 

where P is the optimal value of the primal optimization problem (1). 

To develop the duality theory of the primal problem (1) we pass from inf sup L to sup 
inf L. Denote by D(ip 7 ip 7 p) the dual function 

D(<p,ip,p)= inf L(n,<p,i>,p) 

N N 



i=l j=l 



+N- 1 



N N 
i=l j=l 



(2) 



Hence we obtain as the optimal value of the dual problem 

D := sup D{<p,ip,p) - (E M [y>] + E„ [V>]) - X* VO 

where ^ denotes the admissible set of (/?, ip, p, i.e. satisfying 

+ ip(j) + P(h3) = c (hj), l<i,j <N, 

for some non- negative "slack variables" Qij. 

Let us show that there is no duality gap, i.e., the values of P and D coincide. Of course, in 
the present finite dimensional case, this equality as well as the fact that the inf sup (resp. sup 
inf) above is a min max (resp. a max min) easily follows from general compactness arguments. 
Yet we want to verify things directly using the idea of "complementary slackness" of the 
primal and the dual constraints (good references are, e.g. [PSU88, ET99, AE06]). 

We apply "Fcnchcl's perturbation map" to explicitly show the equality P = D. Let 



T : 



piV 



be the linear map defined as 



T {H i >j))i< i , j < N ) = 



I 



V 



N 



E^-j) 

\i =1 



N 



i=l 




so that the problem (1) now can be phrased as 

N N 

(c,tt) = ^2^2c(i,j)n(i,j) -> min, 
»=i j=i 



under the constraint 



t(-k) = ((n-\...,n- 1 ),(n-\...,n- 1 )). 



The range of the linear map T is the subspace E C M. x K , of codimension 1, formed by 

N N 

the pairs (f,g) such that ^ f(i) = 9(j)> m other words E M [/] = E„[g]. We consider T 

i=l j=l 

as a map from R N to E and denote by E + the positive orthant of E. 
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Let $ : E + — > [0, oo] be the map 

*(/, g) = inf { (c, tt) , tt £ , T(7r) = (/, .g) } . 

We shall verify explicitly that <I> is an M + -valued, convex, lower semi-continuous, positively 
homogeneous map on E+. 

The finiteness and positivity of $ follow from the fact that, for (/, g) G E + , the set of 
7r G with T(7r) = (/, g) is non-empty and from the non-negativity of c. As regards 
the convexity of <&, let (/i, gi), (/2, g 2 ) £ and find 7Ti,7r 2 G R + such that T(m) = 
(/i,gi), T(tt 2 ) = (/2,52) and (c,7Ti) < $(/i,ffi) +e as well as (c,7r 2 ) < $(/ 2 ,g 2 ) + £• Then 

g A/i,gi) + (/ 2 ,g2) \ < / Tn + Tra V c $(/i,gi) + $(/ 2 ,g 2 ) | e 



which proves the convexity of $. 

If ((/„,g„))~ =1 € £+ converges to (f,g) find (7r„)£° =1 in K^ 2 such that T(tt„) = (/„,g„) 
and (c, 7r„) < $(/ n ,gn) + n~ 1 . Note that (7r„)^L 1 is bounded in , so that there is a sub- 
sequence (7r„ fc converging to n G R^ 2 . Hence <&(/, g, ) < (c, 7r) showing the lower semi- 
continuity of <&. Finally note that $ is positively homogeneous, i.e., 3>(A/, Ag) = A$(/, g), 
for A > 0. 

The point (/o,go) with /o = go = (iV -1 , . . . , A -1 ) is in 2? + and $ is bounded in a 
neighbourhood V of (/o,go)- Indeed, fixing any < a < A -1 the subsequent set V does the 
job 

V = {(.Ag) G ^ : 1/(0 - ^"'l < a, |g(i) - < a, for 1 < i,j < A}. 

The boundedness of the lower semi-continuous convex function $ on V implies that the 
subdifferential of $ at (Jo, go) is non-empty. Considering $ as a function on R 2N (by 
defining it to equal +00 on M. 2N \E + ) we may find an element (tp,i>) G R w x M. N in this 
subdifferential. By the positive homogeneity of $ we have 

$(/,g) > (/,g)> = <£/) + $,g), for (f,g) e R N x r", 

and 

P = $(/o,go) = (£,/o) + (?,go>. 
By the definition of $ we therefore have, for each 7r G R+ 2 , 

(c,7r}> inf {(c,7r) : T(tt) = T(7f)} 

#GR~ 2 

= *(T(tt)) 

> (T(tt),(&^)) 
JV AT 

»=i j=i 

so that 

c{i,j) > <fi(i) + ^>(j), forl<«,j<n. (3) 

By compactness, there is 7? G II(/x, z/), i.e., there is an element 9 G R+ 2 verifying T(tt) = 
(fa, go) such that 

(c,7r) = <£ + $,7r). (4) 
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Summing up, we have shown that n and {ip, ip) are primal and dual optimizers and that 
the value of the primal problem equals the value of the dual problem, namely {ip + ip, n). 

To finish this elementary treatment of the finite case, let us consider the case when we 
allow the cost function c to take values in [0, oo] rather than in [0,oo[. In this case the 
primal problem simply loses some dimensions: for the (i,j)'s where c{i,j) — oo we must 
have w{i,j) = so that we consider 

N N 

i=i j=i 

where we now optimize over n G with w{i,j) = if c(i,j) = oo. For the problem to 
make sense we clearly must have that there is at least one n G n(/x, v) with (c, 7r) < oo. If this 
non-triviality condition is satisfied, the above arguments carry over without any non-trivial 
modification. 

We now analyze explicitly the well-known "complementary slackness conditions" and 
interpret them in the present context. For a pair n and {<p, ip) of primal and dual optimizers 
we have 

c{i,j) > + => n(i,j) = Q, 

and 

n{i,j)>0 => c{i,j) = p{i) + i>{j). 

Indeed, these relations follow from the admissibility condition c > ip + tp and the duality 
relation {n, c — {ip + tp)) = 0. 

This motivates the following definitions in the theory of optimal transport (see, e.g., 
[RR96] for (a) and [ST08] for (b).) 

Definition 2.1. Let X = Y = {1, . . . , N} and \i = v the uniform distribution on X and Y 
respectively, and let c : X x Y — > M + be given. 

(a) A subset T C X xY is called "cyclically c-monotone" if, for (ii, ji), . . . , {i n ,jn) S T we 

have 

n n 

^2c(i k ,j k ) < ^c(i fe ,j fe+ i), (5) 
fe=i fc=i 

where j n+1 = ji. 

(b) A subset T C X x Y is called "strongly cyclically c-monotone" if there are functions 

p,ip such that ip{i) + tp(j) < c(i,j), for all (i,j) E X x Y, with equality holding true 

for e r. 

In the present finite setting, the following facts are rather obvious (assertion (iii) following 
from the above discussion): 

(i) The support of each primal optimizer 7? is cyclically c-monotone. 

(ii) Every ir G v) which is supported by a cyclically c-monotone set T, is a primal 

optimizer. 

(iii) A set V C X x Y is cyclically c-monotone iff it is strongly cyclically c-monotone. 



5 



In general, one may ask, for a given Monge-Kantorivich transport optimization problem, 
denned on polish spaces X, Y, equipped with Borel probability measures fx, v, and a Borel 
measurable cost function c:XxF-»[0,oo], the following natural questions: 

(P) Does there exist a primal optimizer to (1), i.e. a Borel measure n G II(^, v) with 
marginals fi, v, such that 



/c d9 = inf / 
Tren^.i/) J 



cdir=:P 

XxY XxY 

holds true? 

(D) Do there exist dual optimizers to (2), i.e. Borel functions (tp, ip) in v) such that 

/tp dfi+ / \j) dv — sup I I tp dfi + ip dv I =: D, (6) 
J (^)e*(^) \J_ J J 

where ^(fJ,, v) denotes the set of all pairs of [— oo, +oo[- valued integrable Borel functions 
(tp, tp) on X, Y such that tp(x) + tp(y) < x(x, y), for all (x, y) G X x Yl 

(DG) Is there a duality gap, or do we have P — D, as it should - morally speaking - 
hold true? 



These are three natural questions which arise in every convex optimization problem. In 
addition, one may ask the following two questions pertaining to the special features of the 
Monge-Kantorovich transport problem. 

(CC) Is every cyclically c-monotone transport plan it G n(/i, v) optimal, where we call 
7r G n(/i, v) cyclically c-monotone if there is a Borel subset r C X x Y of full support 
7r(r) = 1, verifying condition (5), for any (x\,yi), . . . , (x n ,y n ) G T? 

(SCC) Is every strongly cyclically c-monotone transport plan tt G II(/x, v) optimal, where 
we call 7r G II(/i, v) strongly cyclically c-monotone if there are Borel functions tp : X — > 

[— oo, +oo[ and tp : Y — > [— oo, +oo[, satisfying tp(x) + ip(y) < c(x, y), for all (x, y) G X xY, 
and Tr{tp + ip = c} = 1? 

Much effort has been made over the past decades to provide increasingly general answers 
to the questions above. We mention the work of Ruschcndorf [Rus96] who adapted the 
notion of cyclical monotonicity from Rockafellar [Roc66]. Rockafellar's work pertains to 
the case c(x,y) = —(x,y), for x, y G K™, while Ruschcndorf 's work pertains to the present 
setting of general cost functions c, thus arriving at the notion of cyclical c-monotonicity. 
Intimately related is the notion of the c-conjugate tp c of a function tp. 

We also mention G. Kellerer's fundamental work on the duality theory; in [Kel84] he 
established that P = D provided that c : X x Y — > [0, oo] is lower semi-continous, or merely 
Borcl-mcasurable and uniformly bounded. 

The seminal paper [GM96] proves (among many other results) that we have a posi- 
tive answer to question (CC) above in the following situation: every cyclically c-monotone 
transport plan is optimal provided that the cost function c is continuous and X, Y are com- 
pact subsets of R". In [Vil03, Problem 2.25] it is asked whether this extends to the case 
X = Y = W 1 with the squared euclidian distance as cost function. This was answered inde- 
pendently in [Pra08] and [ST08] : the answer to (CC) is positive for general polish spaces X 
and Y, provided that the cost function c : X x Y — > [0, oo] is continuous ([Pra08]) or lower 
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semi-continuous and finitely valued ([ST08]). Indeed, in the latter case, a transport plan is 
optimal if and only if it is strongly c-monotone. 



Let us briefly resume the state of the art pertaining to the five questions above. 

As regards the most basic issue, namely (DG) pertaining to the question whether duality 
makes sense at all, this is analyzed in detail — building on a lot of previous literature — in 
section 2 of the accompanying paper [BLS09]: it is shown there that, for a properly relaxed 
version of the primal problem, question (DG) has an affirmative answer in a perfectly general 
setting, i.e. for arbitrary Borel-measurable cost functions c:lxy^[0, oo] defined on the 
product of two polish spaces X,Y, equipped with Borel probability measures fi, v. 

As regards question (P) we find the following situation: if the cost function c : X x Y — > 
[0, oo] is lower semi- continuous, the answer to question (P) is always positive. Indeed, for 
an optimizing sequence (n n )%Li m n(/x, v), one may apply Prokhorov's theorem to find a 
weak limit tt = Mm^oo 7r„ fc . If c is lower semi-continuous, we get 



which yields the optimality of tt. 

On the other hand, if c fails to be lower semi-continuous, there is little reason why a 
primal optimizer should exist (see, e.g., [Kel84, Example 2.20]). 

As regards (D), the question of the existence of a dual optimizer is more delicate than 
for the primal case (P): it was shown in [AP03, Theorem 3.2] that, for c : X x Y — > M + , 
satisfying a certain moment condition, one may assert the existence of integrable optimizers 
(ip,ip). However, if one drops this moment condition, there is little reason why, for an 
optimizing sequence (ip n ,ip n )^ =1 in (D) above, the L 1 -norms should remain bounded. Hence 
there is little reason why one should be able to find integrable optimizers (ip, ip) as shown by 
easy examples (e.g. [BS09, Examples 4.4, 4.5]), arising in rather regular situations. 

Yet one would like to be able to pass to some kind of limit (<p, ip), whether these functions 
are integrable or not. In the case when ip and/or ip fail to be integrable, special care then 
has to be taken to give a proper sense to (6). 

This situation was the motivation for the introduction of the notion of strong cyclical 
c-monotonicity in [ST08]: this notion (see (SCC) above) characterizes the optimality of 
a given ir 6 n(/i, v) in terms of a "complementary slackness condition" , involving some 
(<p>,ip) £ ^{l-t, v), playing the role of a dual optimizer (<p,i[)). The crucial feature is that 
we do not need any integrability of the functions ip and ip for this notion to make sense. It 
was shown in [BS09] that, also in situations where there are no integrable optimizers (ip, ip), 
one may find Borel measurables functions (<p,ip), taking their roles in the setting of (SCC) 
above. 

This theme was further developed in [BS09], where it was shown that, for [i®v-&.s. finite, 
Borel measurable c : X x Y — > [0, oo], one may find Borel functions (p : X — > [— oo,+oo) 
and ip : Y — > [—00,00), which are dual optimizers if we interpret (6) properly: instead of 




XxY 



XxY 



considering 




(7) 



x 



Y 



which needs integrability of ip and ip in order to make sense, we consider 




(8) 



XxY 
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where the transport plan tt £ n(/x, v) is assumed to have finite transport cost J XxY c ( x ' 2/M 7r ( a; i y) < 
oo. If (7) makes sense, then its value coincides with the value of (8); the crucial feature 
is that, (8) also makes sense in cases when (7) does not make sense any more as shown in 
[BS09, Lemma 1.1]. In particular, the value of (8) does not depend on the choice of the 
transport plan tt £ II( / u, v), provided tt has finite transport cost J XxY c i x , y)dn(x, y) < oo. 

Summing up the preceding discussion on the existence (D) of a dual optimizer {(p,tp): 
this question has a - properly interpreted - positive answer provided that the cost function 
c:lxy^[0,oo]isp® v-a.a. finite ([BS09, Theorem 2]). 

But things become much more complicated if we pass to cost functions c: XxY ^ [0, oo] 
assuming the value +oo on possibly "large" subsets of X x Y. 

In [BLS09, Example 4.1] we exhibit an example, which is a variant of an example due to 
G. Ambrosio and A. Pratelli [AP03, Example 3.5], of a lower semicontinuous cost function c : 
[0, 1) x [0, 1) — > [0, oo], where (A, fi) = (Y, v) equals [0, 1) equipped with Lebesgue measure, 
for which there arc no Borel measurable functions ip,ip verifying (p(x) + ^(y) < c(x,y), 
minimizing (8) above. 

In this example, the cost function c equals the value +oo on "many" points of X x Y = 
[0, 1) x [0, 1). In fact, for each x £ [0, 1[, there are precisely two points 2/1,2/2 <= [0, 1[ such 
that c(x, y\) < 00 and c{x,y 2 ) < 00, while for all other y £ [0, 1[, we have c(x,y) = 00. 
In addition, there is an optimal transport plan tt £ n(^i, v) whose support equals the set 
{(x,y) £ [0,1) x [0,1) : c(x,y) < 00}. 

In this example one may observe the following phenomenon: while there do not exist 
Borel measurable functions (p : [0,1) — > [—00, +00) and tp : [0,1) — > [—00,00) such that 
<p(x) + 4>{y) = c(x,y) on {c(x,y) < 00}, there does exist a Borel function h : [0,1) x 
[0,1) — ► [—00,00) such that h(x,y) = c(x,y) on {c(x,y) < 00} and such that h(x,y) = 
lim„_ >OCJ (</7„(a;) + ip n (y)) where (<Pn>i , n)%Li are properly chosen, bounded Borel functions. 
The point is that the limit holds true (only) in the norm of L 1 ([0, l[x [0, 1[, 7r) as well as 
7?-a.s. 

In other words, in this example we are able to identify some kind of dual optimizer 
h £ i 1 ([0, 1) x [0, 1),7?) which, however, is not of the form h(x,y) — (p(x) + if){y) for some 
Borel functions (ip,ip), but only a 7r-a.s. limit of such functions (ip n (x) + ipn(y))^Li- 

In [BLS09, Theorem 4.2] we established a result which shows that much of the positive 
aspect of this phenomenon, i.e. the existence of an optimal h £ L 1 ^), encountered in the 
context of the above example, can be carried over to a general setting. For the convenience 
of the reader we restate this theorem and the notations required to formulate it. 

Fix a finite transport plan ttq £ n(/i, v, c) := £ Il(/i, v) : J XxY cd-n < 00 j. We denote 
by n( 7r °)( / u, v) the set of elements tt £ such that 7r -c 7r and || 3^ || £«>(„„) < °°- 

Note that U^ n °\fi,i>) = U(fj,,v) ni°°(7ro) C U(fj,,v,c). We shall replace the usual Kan- 
torovich optimization problem over the set II(/^, v, c) by the optimization over the smaller 
set Ilto) Its value is 

= inf {(c,ir) = J cdn : tt £ TL^[p.,u)}. (9) 
As regards the dual problem, we define, for e > 0, 

D^o.e) = SVLV ^j v d^ + j^dv.i P £L 1 {^),ij£L 1 {v), 

/ (f(x) + ip(y) - c(x,y)) + dTT < e\ and 

D M = U m £)(To,e). ( 10 ) 
s— >0 
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Define the "summing" map S by 

S : L\X,n) x L 1 (Y, v) -► L\X x Y> ) 

where ip © ip denotes the function <p(x) + tp(y) on X x Y. Denote by L S (X x Y, n ) the 
||. ||i-closcd linear subspace of L X (X x Y, tto) spanned by S(L 1 (X,^i) x L l (Y, v)). Clearly 
L S {X x Y, 7r ) is a Banach space under the norm ||.||i induced by L X (X x Y, 7r ). 

We shall also need the bi-dual L S (X x Y, n )** which may be identified with a subspace 
of L X {X x Y, 7To)**. In particular, an element h £ -^(-X" x Y, 7To)** can be decomposed into 
h = h r + h s , where h r e L X (X x Y, n ) is the regular part of the finitely additive measure h 
and h s its purely singular part. 

Theorem 2.2. Let c : A x Y ^ [0, oo] be Borel measurable, and let n e f, c) &e a 
/imfe transport plan. We have 

p( 7r o) _ £)(7ro) _ 

T/iere is an element h E L S (X x Y, n )** such that h < c and 

= (h,TT ). 

IfirG H^°\ij,, v) (identifying n with j^) satisfies j cdir < f , ( 7r °) + a /or some a > 0, then 

\(h s ,n)\<a. (12) 

In particular, if tt is an optimizer of (9), then h s vanishes on the set {j^- > 0}. 
In addition, we may find a sequence of elements (<p n ,4>n) € x ^(v) such that 

(fin®^n^h r , TTo-a.S., \ \ (lfi n © 1p n - & ) + || Ll (wo) 

and 

Jim sup lim -(((p n ®ip n )l A ,n ) = \\h s \\ Ll r no )... (13) 

The assertion of the theorem extends the phenomenon of [BLS09, Example 4.1] to a 
general setting. There is, however, one additional complication, as compared to the situation 
of this specific example: in the above theorem we only can assert that we find the optimizer 
h in L (7?)** rather than in L l (ji). The question arises whether this complication is indeed 
unavoidable. The purpose of the subsequent section is to construct an example showing 
that the phenomenon of a non-vanishing singular part h s of h — h r + h s may indeed arise 
in the above setting. In addition, the example gives a good illustration of the subtleties of 
the situation described by the theorem above. 



3 The singular part of the dual optimizer 

In this section we refine the construction of Examples 4.1 and 4.3 in [BLS09] (which in turn 
are variants of an example due to G. Ambrosio and A. Pratclli [AP03, Example 3.2]). We 
assume that the reader is familiar with these examples and freely use the notation from this 
paper. 

In particular, for an irrational a e [0, 1) we write, for fceZ, 1 

Qk{x) = 1 + #{0 < % < k : x © ia G [0, ±)} 
- #{0 < i < k : x®ia G [|,1)}, 

1 In [BLS09] the constructions are carried out for N instead of Z, but for our purposes the latter choice 
turns out to be better suited. 
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where, for k < 0, we mean by < i < k the set {k + 1, k + 2, . . . , 0} and © denotes addition 
modulo 1. We also recall that the function h : [0,1) x [0,1) — > Z is defined in [BLS09, 
Example 4.3] as 

. )Qk{x), k G Z and y = x © fca 
loo, otherwise. 

In [BLS09, Example 4.3] we considered the [0, oo] -valued cost function c(x,y) := h + (x,y). 
We now construct an example restricting h + (x,y) to a certain subset of [0, 1) x [0, 1). 

Example 3.1. There is an irrational a G [0, 1) and a map t : [0, 1) — ► Z such that, for 

r = {{x,x),xe [o,i)}, 

Ti = {(a;, a; ©a) : x G [0,1)}, 
T T = {(x, x © r(x)a) : x G [0, 1)} 



and letting 



c(x,y) 



h+(x,y), for x G T U Ti U T T 
oo, otherwise 



the following properties are satisfied, 
(i) The maps 

T°(x)=x, Tl(x)=x®a, T^(x) =x©(r(x)a) 

are measure preserving bijections from [0,1) to [0,1). Denote by 7To , 7Ti , 7r r i/ie corre- 
sponding transport plans in II(/x, z/) ; i.e. 

n = (id,id)#n, tti = (id,T a )#n, ir T = (id, T^ T) ) #y u, 

and iet 7r = (7ro + 7ri + n T )/3. 

(ii) The transport plans tto and 7Ti are optimal while tt t is not. In fact, we have 

(c, 7T ) = (C, 7Tl) = 1 lu/li/e (c, 7T T ) > (h,n T ) > 1. (16) 

(m,) TTiere is a sequence {tpn>i>n)%Li °f bounded B or el functions such that 

{a) if n (x)+i> n (y) <c(x,y), forxeX,yeY, (17) 

(6) lim ( / p n (a;) d M (x) + / My), My)) =1, (is) 

(c) lim (</? n (a;) = h(x,y), n-almost surely. (19) 



(ni_) Using the notation of [BLS09, Theorem Jf.,2] we find that for each dual optimizer 
h E i 1 (7r)**, which decomposes as h = h r + h s into its regular part h r G L 1 ^) and its 
purely singular part h s G L l {ir)**, we have 

h r = h, ir-a.s., (20) 

and the singular part h s satisfies ||/i s ||i 1 (7r)** = (h,TT T ) — 1 > 0. In particular, the 
singular part h s of h does not vanish. The finitely additive measure h s is supported by 
T T , i.e. (h s ,lr +l ri > = 0. 
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We shall use a special irrational a £ [0, 1), namely 

oo ^ 



where Mj = toito 2 ■ ■ - rrij — Mj-imj, and (rrij)° < L 1 is a sequence of prime numbers mj > 5 
tending sufficiently fast to infinity, to be specified below. We let 



n 1 



which, of course, is a rational number. 

We will need the following lemma. We thank Leonhard Summerer for showing us the 
proof of Lemma 3.2. 

Lemma 3.2. It is possible to choose a sequence m 1 ,m 2 , ■ • ■ of primes growing arbitrarily 
fast to infinity, such that with Mi — mi, M 2 = mi • m 2 , . . . , M n = mi • • ■ m„, ... we have, 
for each neN, 

" 1 P 

Er _ r n 

3 = 1 J 

with P n and M n relatively prime. 
Proof. We have 

El _ m 2 . . . m n + . . . + m n + 1 _ P n 

thus P n and M„ are relatively prime, if and only if 

mi{ m 2 • • • m„+ m 3 • • • m„ + . . . + m„ + 1 (21) 

m 2 f m 3 • • • m„ + . . . + m„ + 1 (22) 

: : (23) 

m n _i | m„ + 1. (24) 

We claim that these conditions are, e.g., satisfied provided that we choose mi, m 2 , . . . such 
that mi > 3 and 

m i+1 = +1 (m.) (25) 
m i+j = -1 (m^ if j > 2. (26) 

for all i > 1. Indeed (25), (26) imply that for fc S {1, . . . , n — 1} we have modulo (m*,) 

mfe+i---m n + m/c+2---mn+ mfc+3"-™n+ • • •+ m„+ 1 = 

(±1)+ (±1)+ (Tl)+ -..+ (-1)+ (+1), 

where in the second line the (n — k + 1) summands start to alternate after the second term. 
Thus, for even n — k, this amounts to 

m fc+ i • • • m n + m k+2 ■ ■ ■ m n + m k+3 ■ ■ ■ m n + . . .+ m n + 1 

I ■ (-1)+ (+1)+ -..+ (-1)+ (+1)^-1, 
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while we obtain, for odd n — k, 

m k +i ■ ■ ■ m n + m k+2 ■ ■ ■ m n + m k+3 ■ ■ ■ m n + 



(+1)- 



(+1)+ 



(-1) + 



m„+ 1 = 

(-1)+ (+l) = +2 



Hence (21)-(24) are satisfied as the m n where chosen such that m„ > 2. 

We use induction to construct a sequence of primes satisfying (25) and (26). Assume that 
m 1 ,...,m t have been defined. By the Chinese remainder theorem the system of congruences 



-1 (mi), 



-1 (nii-i), x = +1 (m l ) 



has a solution xq £ {1, . . . , mi . . . m^}. By Dirichlet's theorem, the arithmetic progression 
xq + kmi . . .rrii,k 6 N contains infinitely many primes, so we may pick one which is as large 
as we please. The induction continues. □ 

For (3 e [0, 1), denote by T : [0, 1) ->■ [0, 1), Tp{x) := x © j3 the addition of j3 modulo 1. 
With this notation we have T^ 1 = id and, by Lemma 3.2, it is possible to choose mi, . . . , m n 
in such a way that M n is the smallest such number in N. Our aim is to construct a function 
t : [0, 1) — ► Z such that the map 



T (r) . 



[0,1) 

x 



[0,1) 



-> Tk T >(x)=T T J x \x) 



defines, up to a /u-null set, a measure preserving bijection on [0, 1), and such that the 
corresponding transport plan 7r r e IT(/i, u), given by ir T = {id,Ta^)#[i, has the properties 
listed above with respect to the cost function c(x, y) which is the restriction of the function 
h + (x,y) to T U Ti U T T . We shall do so by an inductive procedure, defining bounded Z- 
valued functions t„ on [0, 1) such that the maps T^f 1 are measure preserving bijections on 
[0, 1). The map ri r) then will be the limit of these T<£ n) . 

Step n=l: Fix a prime M\ = mi > 5, so that ol\ = Define 



■= fci = l,...,Mi, 



so that (Jfci )fef =i forms a partition of [0, 1) and T ai maps I kl to I kl +i, with the convention 
Mi + 1 = 1. We also introduce the notations 

Ll [0.2-5ST) and Rl H + dn^) 
for the segments left and right of the middle interval 

rl t _ rl 1 1 j l \ 

-'middle •— 1 (M 1 +l)/2 — 15 2MT' 2 ' 2MT'' 

We define the functions (p 1 ,^ 1 on [0, 1) such that (/3 1 (x) + xp 1 {x) = 1 and 

fo xeL 1 

^(x)+^(T ai (x)) = 



1 x G ^middle 

2 Z e i? 1 



which leads to the relation 



<p 1 (T ai (x)) = <p 1 (x) + * 



x e L 1 , 

X ^ ^middle' 
X £ R 1 . 
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Making the choice tp = on I\ this leads to 

i (x)= {ki-h xeI il i 1 6{l,...,(M 1 + l)/2}, 

V { 1 \ Mi + 1 - h, xe I kl , ki G {(Mi + 3)/2, MJ, 

^(x) = 1 - ^(x). 

The function ip 1 starts at 0, increases until the middle interval, stays constant when stepping 
to the interval right of the middle, and then decreases, reaching 1 on the final interval Im ± ■ 
The idea is to define the map t\ : [0, 1) — > Z in such a way that the map 



[0,1) - [0,1) 



is a measure preserving bijection enjoying the following property: the map 

equals the value two on a large set while it has concentrated a negative mass which is close 
to —1 on a small set. 

This can be done, e.g., by shifting the first interval I\ to the interval I(m 1 -i)/2^ which is left 
of the middle one, while we shift the intervals I2, ■ ■ ■ , I(m 1 -i)/2 by one interval to the left. 
On the right hand side of [0, 1) we proceed symmetrically while the middle interval simply 
is not moved. 




S2. 



T 1 

1 middle 

f 1 




L 1 



R 1 



I M± — 1 



'Mi 



Fig. 1. Representations of ip 1 and r . 

The step function is (p 1 and the arrows indicate the action of This figure corresponds 

to the value Mi = 11. 



More precisely, we set 



ti{x) = { 



( Afi -3 

2 ' 

-1, 

0, 

1. 

Mi -3 



x e h, 

x€l kl ,h e {2,...,(Mi-l)/2}, 

X G i(Mi+l)/2) 

xel kl ,ki& {(Afi + 3)/2,...,Mi}, 
a: S /mi • 



(28) 
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Then induces a permutation of the intervals (Iki)k =1 ano - a snor t calculation shows 

that 



xeh 1 ,k 1 e{2,...,(M 1 -l)/2, 

(M 1 + 3)/2,...,Mi-]}. 

2 , a; G 4 x ,fci = l,Mi, 

. 1, £ 6 J(Mi+l)/2< 



Next figure is a representation of this "quasi-cost" at level n = 1, with the same value 
M% = 11 as in Figure 1. 



2 
1 -f 





Mi -5 
2 









1 


i i 





n 

± middle 




1 


Fig. 2. Representation of ip 1 + ip 1 o . 



Assessment of Step n = 1. Let us resume what we have achieved in the first induction step. 
For later use we formulate things only in terms of </' 1 (-) rather than ip 1 ^) = 1 — <p (•). 
For the set Jf = {2, . . . , y {-^, . . . , M x - 1} of "good 2 indices" we have 



^(x)-<f 1 (T^(x)) = 1, a: e Jjt^fex e Jf, 
while for the set Jf = { 1 , Mi } of "singular indices " we have 

Mi - 3 



so that 



J [<p\x)-<pHT£Hx))]dx = 



x ei kl , he J s 



(30) 
(31) 



Mi - 3 2 
2 Mi 



= -1 



3 
Mi' 



For the middle interval /,i lidd i c 



I(M 1+ i)/2 we have <p x {x) ~ ip x (TQ{x)) = 0. 
We also note for later use that, for x <E [0, 1), the orbit (T£ (x))Jlj^ never visits Middle - Here 
we mean that i runs through {n(x), ti(x) + 1, . . . , — 1} when Ti(x) < and runs through 
the empty set when Ti(x) = 0. 

Step n—2: We now pass from a x = to 012 = + ^rj-, where M 2 = Mim 2 = mim 2 and 
where mg, to be specified below, satisfies the relations of Lemma 3.2 and is large compared 
to Mi. For 1 < ki < Mi and 1 < fc 2 < to 2 we denote by Ik lt k 2 the interval 



fei ,fc 2 



-l fei 



Ml 



M 2 ' Mi 



M, 



We use the term "good" rather than "regular" as the abbreviation r is already taken by the word "right" 
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Similarly as above we will also use the notations L 2 = [0, ^ — 2 ^-), R 2 — [5 + 1), and 



^middle — ^(Mi + l)/2,(m 2 + l)/2 



2 2Af 2 ' 2 + 2M 2 ) ' 

We now define functions (fi 2 ,ip 2 such that f 2 (x) + tp 2 (x) = 1 and 



This is achieved if we set, e.g., <p 2 = on J^i, and 

V 2 {T a2 (x)) = v 2 {x) + { 



0, .t e l 2 
1 

2, a; G i? 2 . 



• T G ^middle' 



X £ L 2 , 

X G ^middle' 



-1 xeR 2 , 

ip 2 (x) = 1 - ip 2 (x). 

Yet another way to express this is to say that for j G {0, . . . , M 2 — 1} we have 



v\T^{x)) = #{z G {0, . . . , j - 1} : U 2 (x) G L 2 } 

-#{ie{0,...,j-l}:r Qj (i) Gi? 2 }' 



x e I 



(32) 



(33) 



in analogy to (14). 

While the function <p 1 (x) in the first induction step was increasing from I\ to /(Mi+i)/2 
and then decreasing from I/m 1 +3)/2 to Im^ the function f 2 {x) displays a similar feature on 
each of the intervals Ik 1 - roughly speaking, i.e. up to terms controlled by Mi, it increases 
on the left half of each such interval and then decreases again on the right half. The next 
lemma makes this fact precise. We keep in mind, of course, that m 2 will be much bigger 
than Mi. 

Lemma 3.3 (Oscillations oft/? 2 ). The function ip 2 defined in (32) has the following proper- 
ties. 



(1) \ V 2 {x) - <p 2 (x © ^)| < 4M 2 , x G [0, 1). 
(ii) For each 1 < k[, k x < Mi we have 

Proof. Let us begin with the proof of (i) . 

• Proof of (i). While T^ 1 = id holds true, we have that T^ 1 is only close to the identity 
map. In fact, as T a2 (x) = x © , we have 



Ta 2 1 {x) — X © j£ . 



Mi 



(34) 



Somewhat less obvious is the fact that T™ 2 2 also is close to the identity map. In fact 

T^- 2 (x)=xQ^. (35) 
Indeed, by (25) applied to i = 1, there is c G N such that m 2 = cM\ + 1. Hence 

,1712 + 1 



T^- z (x)=x®{m 2 -2)- 
= x © (cMi - 1) 



M 2 

1712 + 1 

M 2 



cM 2 - m 2 + (m 2 - 2) _ 2 

— X tt? r~ ~ — - X (3) . 

M 2 M 2 
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Here is one more remarkable feature of the map T™ 2 2 . 

Claim: For x £ [0,1) the orbit (X^x))™ 2 ^ 2 visits the intervals L 2 = [0, \ - ^) and 
R 2 = [i + 2]gj ' 1) approximately equally often. More precisely, the difference of the visits of 
these two intervals is bounded in absolute value by AM\. 

Indeed, by Lemma 3.2, the orbit (T£ (x))*^ visits each of the intervals Ik u k 2 exactly one 
time so that it visits L 2 and R 2 equally often, namely M2 ~ 1 times. The Ml many disjoint 

subsets (ri^ 2 2) (Ta 2 (x))™? 1 of this orbit are obtained by shifting them successively 

by 2/M 2 to the left (35). As the difference (T* 2 (x))^\ (T^ m2_2) (T l a2 (x))™ 2 " 2 ) ^ consists 

1 i =1 M 

only of 2Mi many points we have that the difference of the visits of (Tqj™ 2 ^ (T^, 2 (x))"^ 1 J 

V l — J j=i 

to L? and R 2 is bounded by 4M X . This implies that the difference of the visits of (T* 2 (x))™ 2 f 2 

to L? and i? 2 can be estimated by 4Mi too: indeed, if this orbit visits 4Mi + k many times 

L 2 more often then i? 2 (or vice versa) for some k > 0, then (T 1 ™ 2-2 ^^ (x)))™\ _2 visits L 2 at 

least 4Mi + fc - 4 many times more often than R 2 etc. and finally (T^ l(m2 ~ 2) (T^ 2 (x)))™ 2 " 2 
visits L 2 at least A: many times more often than R 2 which yields a contradiction. Hence we 
have proved the claim. 

To prove assertion (i) note that by (34) and (35) 

V M oT^(x)=x© 1 l 7 (36) 

Mi — 1 ^ m _ 2 )+Af 

We deduce from the claim that the difference of the visits of the orbit (T^ 

to L 2 and i? 2 is bounded in absolute value by Ml 2 ~ 1 (4Mi) + Mi which proves (i). 

• Proof of (ii). As regards (ii) suppose first fcj = fc" =: fei. Note that, for x 6 /[ oft := 

+ 2MT - we have that the orbit i T U x ))^ visits L 2 one time more 

often than R 2 , namely Ml 2 +1 versus Ml 2 ~ 1 times. If we start with x £ 7fe li i then, for 

1 < j < 535^ - 1 we have that T^{x) £ ij^. Hence, for the orbit (T^L 2 / 577 ^^ 1 , 
the difference of the visits to the interval L 2 and R 2 equals Ltm"J — 1; the integer part of 



1112 



25ii ~~ ^ ' Combining this estimate with the estimate (i) as well as the fact that the distance 

between x © (LlgrJ ~ *) jfe and x ® l^ST is bounded b y 2j ^ 2 ~ 1 , we obtain, for x € / fel ,i 
and y £ I m 2 +i , that 



fci, 2 



rt)-^(*)>^(T« 1 (x))-^(x)- ^(2/)-^(T a r Ul (x)) 

>(L2Tf7J-l)-(2Mi-l)(4M 2 ) 



-2Mi 

»2 

2 Mi 



> "2 _ o il .f 3 



Passing to the general case 1 < k[ , fc" < Mi observe that kl maps / ( m2 +i to 

fen 2 

I m 2 +i , „ , , • Using again (i) we obtain estimate (ii) . □ 

We now are ready to do the inductive construction for n = 2. For to 2 satisfying 
the conditions of Lemma 3.1 and to be specified below, we shall define t 2 : [0,1) — ► 
{— M 2~ 1 , . . . , 0, . . . , M 2 ~ 1 }, where M 2 = mim\, such that the map 

T (r 2 ).j M ^ I ' 1 ) 

" 2 ' I x ^ Til 2) (x) := T T a l (x \x) 
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has the following properties. 



(i) The measure-preserving bijection T { J 2 2) : [0, 1) -> [0, 1) maps each interval I kl onto 
Tai (J/sJ. It induces a permutation of the intervals // Sl ,fe 2 , where 1 < fci < Mi, 1 < 
fc 2 < m 2 . 

(ii) When t 2 (x) > 0, we have 

T l a2 (x)t kiddie. ; = 0,...,t 2 (z), (37) 
and, when r 2 (x) < 0, we have 

g Buddie. i = 75(a;),...,0. (38) 

(iii) On the "good" intervals J fcl , where h G J? = {2, . . . , ^^-} U {^^, . . . , Mi - 1}, for 
which we have, by (30), 

<p\x)-<p\T™{x)) = \, 
the function t 2 will satisfy the estimates 

An^/T!}]^*], (39) 

and 

/4/Vf 2 

|l-/ (a;)+ ^ (T ( ? ) (a;)) | da;< _ j_. (4Q) 

KifcJJ - fc l 2 

(iv) On the "singular" intervals 7/^ , where k\ G Jj 5 = {1, Mi}, for which we have , by (31), 

we split {1, . . . ,m 2 } into a set J kl ' 9 of "good" indices, and a set J kl ' s of "singular" 
indices, such that 



ip 2 (x) - ^ 2 {t£\ x )) = 0, for x e I kuk2 ,k 2 G J kl ' g , 

while 

^ 2 (x) - <p 2 {T£\x)) < -jfc + 2QMf for x G J fel , fe2 ,fc 2 £ 

where J* 1 ^ consists of Mi (Mi — 3) many elements of {1, ... , m 2 }. 
Hence we have a total "singular mass" of 



E E / [^(x)-<p 2 (T^\x))]dx<-l + ^ + 

where c(Mi) is a constant depending only on M x . 
(v) On the middle interval /fiddle = we simply let t 2 = n = 0. 



3_ _,_ c(M t ) 

JT12 : 



(41) 



Let us illustrate graphically an interesting property of this construction, namely the 

Oi2 



shape of the quasi-cost function ip 2 + ip 2 o T"/ T2 
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2 



1 



- t 

o: 



J, 



JV/l-l 
Mi 



1 



middle 



OC — 




D 



D 







Fig. 3. Shape of the quasi-cost ip 2 + ip 2 o T, 




The strips in this graphic representation symbolize the oscillations of the function ip 2 + ip o 
. On the "singular" set, it achieves values of order —M 2 /M 2 . 

It will sometimes be more convenient to specify to which interval Ii lt i 2 the interval Ik x ,k 2 
is mapped under Ta 2 \ instead of spelling out the value of t 2 on the interval Ik u k%- Note 
that by Lemma 3.2, for each map associating to (ki,k 2 ) a pair (^1,^2), there corresponds 
precisely one value T 2 \i k k2 ■ Ik\,k 2 ~~ * {^^'h + 1, ■ ■ ■ , 0, . . . , M 2 — 1} such that (37) (resp. 
(38)) is satisfied and T^\l kl . k . 2 ) = I hM . 

Let us start with a "good" interval I kl , with k\ £ Jf as in (iii) above, say k\ £ 
{2, . . . , M ^~), for which we have T\{x) = —1. Then the intervals Ik ± ,2, ■ ■ ■ , Ik 1 ,m 2 are 
mapped under Tal (x) = T~^(x) onto the intervals 1^—1,1, ■ ■ ■ , Iki-i,m 2 -i- Defining 
t~2{x) — T i(x) on these intervals we get for x £ I kl ,k 2 , where 2 < ki < ^^-,2 < k 2 < m 2 , 



We still have to define the value of t 2 (x), for x £ Ik\,i- The map Ta 2 has to map I kl ,i 
to the remaining gap Ik 1 -i.m 2 , which happens to be its left neighbour. We do not explicitly 
calculate the unique number T 2 \i h 1 £ {— M 2 + 1, . . . , M 2 — 1}, satisfying (37) (resp. (38)), 
which does the job, but only use the conclusion of Lemma 3.3 to find that, for x £ Iki,i such 



1 = <p\x) <?{T£\x)) = V \x) - V \t£\x)). 



(42) 



that T£ 2> (x) E I kl -i, 



\l-[v 2 {x)-v 2 {T£\x))]\<AM 2 1+ l. 



(43) 



This takes care of the "good" intervals I kl , where k\ £ {2, ... , 



Mi-l 
2 



}• 
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1 

M 2 



E— I(kx-X) — It— /fcj — I 

Kg. ^-a. fex G Jf on the left side. 3 

For the "good" intervals J^, where fci G { Ml 2 +3 ,M\ — 1} we have T\{x) — 1 so 
that Ti T 2 l] maps the intervals Ih lt ii • • • j 4i,m3-i to /fci+1,2, ■ ■ • , ^fci+i,m 2 - Again we define 



T2(a;) = ri(x) = 1, for x in these intervals so that we obtain the identity (42), for Ml 2 +3 < 

-.to) 



2 

fci < Mi — 1 and 1 < fc2 < TO2 — 1. Finally, Ta 2 2 ' has to map Ik 1 ,m 2 to the interval 
so that again we derive an estimate as in (43). 



1 

Ml 




E— hi — !f— I(k 1 + 1) — * 

Fig. 4~b. ki G Jf on the right side. 

This finishes item (iii) i.e. the definition of T2 on the "good" intervals Ik ± ■ Noting that on 
this set we have t\ ^ only on M\ — 3 many intervals of length -p- we obtain the estimate 
(40). 

To show (iv) let us first consider the "singular" interval I\, on which we have Tiix) = 
and ip 1 {T i a T l 1 \x)) = ip 1 (Ti T 1 1 \x)) - cp 1 (x) = ^f^. For the subintervals h M of I u 
define the set of good indices as J 1 ' 9 — J 1 ' 3 ' 1 U J 1 ' 3,1 " where 

jl.g.i = r (Afi-3)(M!-l) | 1 m 2 -l j jl,g,r _ f m 2 +l m _ (Mi -3) (Mi +1) -, 

Let us start by considering ki £ J 1,9,r . We define 

/ x M i-3, r (Mi-3)(Mi + l) _ , Tlor 

7a (a;) = n(x) + — - — Mi = i ^ x G A lfel ,*2 G J 1 ' 9 ^. 

First note that T^ 2 '' then maps the intervals Ji,fc 3 , for &2 G J 1 ' g ' r , to the intervals 

I mi-1 m 2 + l , (Mi-3)(Afi + l) , ■■■ , I M x -1 . 
2 ' 2 1 2 2 > r ™ 2 

Observe that, for x as above, the orbit (T^ 2 (x))^q 1 always lies in the right halfs of the 
respective intervals Ik t ■ 

Let us count how often the orbit (T^ 2 (x))l^^~ l visits 1? and R 2 respectively, for x G 
Ii M and fc 2 G J 1,3 ' 1 ". The first n(a;) = elements of this orbit are all in L? which 

yields, similarly as in the induction step n = 1, 



3 Figure 3 is built with the small value m,2 = 7 for the sake of clarity of the drawing. But this value is 
not feasible since with the lowest mi = 5, (25) implies that 7712 is at least equal to 11; other requirements of 
the construction imply that it has to be even larger. 
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But the next Mi many elements of this orbit, namely 

\ 1 a 2 \ JL ))i =Tl (x) 

visit R 2 one time more often than L 2 as the unique element of this orbit which lies in -Tiydle 
belongs to the right half of kiddie- 



This phenomenon repeats on the orbit (T£ (x)) i _ 
that 



ti{x)+—^Mx-1 for Mi _ 3 



many times so 



V 2 {x) - </?(T™(x)) = V 2 (x) <?(Tg>(x))) + ^(T^)(x)) - ^ 2 (T(-)(x) 

Mi - 3 Mi - 3 

= 1 

2 2 

= 0, for x&Ii k2 and h € J 1 ' 9 ' 7 '. 



(44) 



This takes care of Ii,k 2 with fc 2 G J 1,9 ' r . 

For cc 6 Ji fe 2 with fc 2 G J 1 ' 3 ' , the left half of the "good" intervals, we define symmetrically 



r 2 (x) = n(a;) 



Mi -3 



Mi 



_ (Mi-3)(Mi-l) 



A similar analysis as above shows that Ta 2 2 maps the intervals h,k 2 , where fes 6 J 1 ' 9 ' 1 , to 
the intervals J m^-i , . . . , I Ml -i m 2 -i (Mi-3)(A/i-i) ■ Hence by a symmetric reasoning we 

2 ' 2 ' 2 ~ 2 

again obtain equality (44) for x in the intervals I\,k 2 , an d for fc 2 G </ 1:9 ' r too. 

Now we have to deal with the "singular" subintervals I\,k 2 : where k 2 G J 1,s , and the 
singular indices are given by 

J M = {l,...,m 2 }\J 1 ' 9 

= {!,,,,, (AA-aKAA-i) } u {TO2 _ (M^3KM 1+ i) + 1; | mz}; 

which consists of Ml (Mi — 3) many indices. 

The map Ta 2 2 ^ has to map these intervals 1\ k 21 where fc 2 G J l s , to the "remaining gaps" 
I Ml -i , in the interval J Ml -i, where l 2 G {^- , H^l + iM^MMl±3 

2 ' 2 2 

1}. Note that the corresponding intervals I m 1 -i are - roughly speaking - in the middle 
of the interval Im-i—i , while the intervals I\.k 2 , with fc 2 G J 1:S , are at the boundary of I\. 

2 

To define r 2 on Ii.k 2 , for fc 2 G J 1,s , choose any function r 2 taking values in {— M 2 + 
1,...,M 2 — 1}, satisfying (37) (resp. (38)) as above, which induces a bijection between 
the intervals (Ji k 2 )k 2 eJ l < B an d the intervals I m-l-i considered above. 

2 ' h 




J, ' " J Af t -1 

2 

Fig. 5. r 2 for the "singular" indices on the left side. 
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In this drawing, the interval I 1 ' 9 ' 1 is the union of the intervals I\,k 2 with k 2 G J l g ' 1 . A 
similar convention holds for I 1 ' 9 ^ and I 1 ' 8 (which is not an interval anymore). 

For each such t 2 we obtain, for i£/h fee J 1 ^, from Lemma 3.3 



- ^Ti 7 \ x) ) < + 10M? + 2 ^-^ h - l \ Ml 

<_^L + 20 M 1 4 . 
" 2Mi 1 



(45) 



k 2 e.n 



Indeed, the leading term -jsif an d the first error term 10M] 3 in the first line above come 

from Lemma 3.3-(ii) when comparing the difference of the value of (p 2 on the interval 

to that of 7mi-i m 2 +i ■ F° r the difference of the value of t^ 2 on ii,fc 2 and I m x -i [ , for 

arbitrary fc 2 e J 1,a and * 2 G - (Mi^Mi-i^ . . . , E»*tl + (m 1 -3Km 1 -i ) } wc 2 apply for 

both cases at most ( Ml ~ 3 H Ml+1 ) times estimate (i) of Lemma 3.3 which gives (45). 
In particular, for m 2 > 40Mf , which of course we shall assume, we have that 

<p 2 (x) - ^ 2 {t£\x)) < 0, for x G h M ,k 2 G J 1 '*. 

There are Mi (Mi -3) = M 2 -3Mi many intervals 7i )fc2 with fc 2 G J M each of length 1/M 2 . 
Hence we may estimate the "singular mass" on the interval I\ by 

E / ^ - ^C^C*))] d* < ( - + 20M?)(M 2 SMi)^ 

"* 1|fc2 1 2 (46) 

< 1 I 3 I C ( M 
" 2 2Mj 2m 2 

where c(Mi) is a constant depending on Mi only. 4 

We still have another "singular" interval at the present induction step n = 2, namely Im x ■ 
The analysis for this case is symmetric to the analysis of I\ and - after properly defining 
t 2 on this interval Im x ~ we arrive at the same estimate (46). In total, the thus obtain (41) 
by doubling the right hand side of (46), showing that the "singular mass" essentially equals 
-1. 

Finally define the sets Jf (resp. J|) of "good" (resp. "singular") indices at level 2 as 

Jf = {(h,k 2 ) : (fci G Jf and 1 < k 2 < m 2 ), or (fci G J x s and fc 2 G J kl ' 9 )}, 
J| = {{h,k 2 ) : fci G Ji s and k 2 G J fel < 2 }. 

This finishes the inductive step for n — 2. 

General inductive step. Suppose that the prime numbers mi, . . . ,m„_i have been de- 
fined. We use the notation a rl _i = + ■ ■ ■ + M , where M n _\ — rti\ ■ m 2 ■ . . . ■ m n _\. 
For a prime m„ satisfying the condition of Lemma 3.2, and to be specified below, let 
M n = mi • . . . • m„ and 



L n = 



'^~2M; 1 ' Rn 



1 1 



2 2M „' ) ' middle 



1111 



2 2M„ ' 2 2M r , 



4 We shall find it convenient in the sequel to write c(Mi, M2, . . . , Mi) for constants depending only on 
the choice of the numbers Mi, M2, . . . , Mj. The concrete numerical value of this expression may change, 
i.e. become bigger, from one line of reasoning to the next one, but at every stage it will be clear that 
an explicit bound for the respective meaning of the constant c(Mi, M2, ■ . . , Mi) could be given, at least 
in principle. In fact, we shall always have that the constants c(Mi, M2, ■ ■ . , Mi) used in the sequel are 
dominated by a polynomial in the variables Mi , M2 , . . . , Mi . 
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For 1 < k\ < mi, . . . , 1 < k n < m n , let 

j _ r fci-l , fc 2 -l i I k n — l fci -1 , fc 2 -l I I fc„ \ 

For a; £ and j e {0, . . . , M„} we define, similarly as in (33), <p n (x) — and 

<P n (Ti n (x)) = G {0, . . . ,j - 1} : G L™} 



(47) 

-#{ie{0 1 ..,j-i}:T* 2 (x)en 



where a„ = a n -i + -p- and M„ = M„_ito„. We also let tp n (x) = 1 - o5 n (x), for x G [0, 1). 

Lemma 3.4 (Oscillations of ip n ). For given Mi, . . . , M„_i there is a constant c(Mi , . . . , M„_i) 
depending only on Mi, . . . , M„_i, such that for all m n as above we have 

(i) \ip n (x) - (p n (x © -m^)\ < c(Mi, . . . , M„_i), 

(m,) /or eac/i f < fc^, fc" < Mi, . . . , 1 < k' n _ x , k' r [_ 1 < m n _i, 

G9 n , r -09™, > TT^ C(M 1 ,...,M„_ 1 ), 

(Hi) for each 1 < k'i,k'{ < Mi,...,l < k' n _i,k'^_i < m n _i, and f < k' n ,k'^ < m n , with 
mm{k' n ,m n — k' n } < M n _i and min{fc",m„ — fc"} < M„_i we have 



<P \ — 09 I 

K l '•■•'' c n — 1 ,K n K l '■•■' K n 



<c(M 1 ,...,M n _ 1 ). (48) 



Proof. We may and do assume that m„ > 5M„_i. 

• Proof of (i). We have T an (x) = T otn _ 1 (T 1/Mn (x)) so that 



T^{x)=x®^=x®^, (49) 

in perfect analogy to (34). As regards the analogue to (35) things now are somewhat more 
complicated. First note that there is a unique number 1 < q n -i < M n -i — 1 such that 

T% r _\{x)=xQ-}—, xg[0,1). (50) 

lvl n— 1 

Indeed, by Lemma 3.2, when q n -i runs through {1, . . . , M„_i — 1}, the left hand side assumes 
the values x Q jj-^ , where l n -i also runs through {I, . . . , M„_i — 1}. 

Claim: Letting r n = [ ^ n - j , £/ie integer part of ^ n - , and taking q n -i as in (50), we /lave 



I^» M »- 1+ 9»- 1 (a;)=. 



d n -l 



w/iere |d„_i| < M n _!. 

Indeed, write m„ as m„ = r„M„_i + e„_i, for some 1 < e„_i < M„_i to obtain 

= (t£»-t- ot« :: ; otv(x) 

M„ 

M»_i 1 gn-1 

- xer -iwr G M^7 e m: 

M„ M„ M„_i M„ 

_ gn-l - e»-l 

— X © 7~r — • X o3 , . 

M n M n 
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which proves the claim. 

Define s^ 1 ^ = q n -i if d n -i = q n -i — e n -i > and s| l 1 2 1 = g„_i + M„_i otherwise, to obtain 
by (49) and (50) that 

r,Jf,-i+«l'li l^li 

Ta n (x) = x (B Mn , 

for some € {1, . . . , M„_i}. We also deduce from (49) that must actually be in 

(2) (2) 

Repeat the above argument to find s y n _ 1 with — 2M„_i < s y n _ 1 < 2M„_i such that 



for some l n _i G {1, . . . , M n _i — 1}. Continuing in the same way, we find numbers s„_i, for 

(i) 

j = 1,2,..., M„_i - 1 verifying -jM„_i < < jM„_i such that 

T ^nM„_ 1+Sn _ 1(a;)=a;e ^ 5 (51) 



for some Z^2i € {1, . . . , M tt _i — 1}. Note that, under the assumption m n 3> M„_i so that 

r„ ^> M„_i, the elements in (51) are all different. Therefore (^.i)^" -1-1 runs through all 
elements of {1, ... , M n _i — 1} when j runs through {1, . . . , M„_i — 1}; in particular there 
must be some jo such that 

T a „ (x) — x -p^-, 

in analogy to (36). 

Now observe that there is a constant c(M\, . . . , M„_i), depending only on Mi, . . . , M n _i, 
such that, for a; G [0, 1), the difference of the number of visits of the orbit (T^ n (x))ll n - 1+qn - 1 
to L n and R n is bounded in absolute value by the constant c(Mi, . . . , M n _i). The argument 
is analogous to the corresponding one in the proof of the claim which is part of the proof of 
Lemma 3.3-(i), and therefore skipped. 

The numbers jo as well as s^.\ are bounded in absolute value by M^-\ so that the differ- 

ence of the visits of the orbits (T* n (x)) i=0 " ™ _1 to L n and R n are bounded in absolute 
value by some constant c(M 1; . . . , M n _i). This finishes the proof of assertion (i). 

• Proof of (ii). Suppose first, as in the proof of Lemma 3.3-(ii), that (k[, . . . , k' n _ 1 ) = 
(k", . . . , fc"_i) =: (k\, . . . ,k n -\). For x G Iki,...,k n -i,i we have that each of the orbits 

(^"-'^(i))^- 1 " 1 , for j = 0, . . . , Lal^J - 1 visits Ln one time morc often than R " ■ 
Hence 

Noting that 

T<*n " 1 ( X ) — X © L 2 M„"-1 J 

and 

m n +l _ I m„ I M„_i ^ Mn-i 
2M„ L2M„_iJ M„ — M„ ' 

we obtain (ii) by using assertion (i), and possibly passing to a bigger constant c(M\, . . . , M„_i). 
Finally the passage to general (fci, . . . , k' n _ 1 ) and (fc", . . . , is done again, similarly as 

in the proof of Lemma 3.3, by repeated application of (i) and by passing once more to a 
bigger constant c{M\, . . . , M„_i). 
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• Proof of (iii). Fix 1 < k[ , k'( < M u . . . , 1 < k' n _ 1 ,k'^_ 1 < m n _i and 1 < k' n ,k'^ < m n 
as above. Suppose, e.g., k' n < M„_i and m n — k' 7 [ < M„_i, the other three cases be- 
ing similar. Denote by (k"', . . . , fc^'_i) the index so that Ik' 1 ",...,k"'_ 1 = Ik'{, ...,k"_ 1 , h'l ® 
i-e. is tnc right neighbour of . Now find < g„_i < M n _i 

such that TaZll maps /fe^ 1 ... 1 fej i _ 1 onto Ik'{' ....,k'"_ 1 - Hence TaZ 1 maps Ik' 1 ,...,k' n _ 1 ,k' n onto 
4;",...,fc;"_ 1 ,fe;+g„_ 1 - 

Finally note that the distance from the latter interval to lfcj',...,k" fc» is bounded by 
(2M„_i + M„_i) -jj-. Hence we obtain (48) by applying 2M„_ 1 + M n _i times assertion 
(i) and using < q n -i < M n _\. □ 

After this preparation we are ready for the inductive step from n — 1 to n. Suppose that 
the following inductive hypotheses are satisfied, for 1 < I < n — 1, functions T; : [0, 1) — ► 
{— Mj + 1, . . . , Mi — 1} and index sets Jf , J* contained in {(fci, . . . , h) : 1 < fci < mi, . . . , 1 < 

fej < TO/}. 

(i) The measure preserving bijection T&T^ : [0, 1) [0, 1) maps the intervals Iki,...,kn 
for 1 < I < n — 1, and 1 < fci < mi, . . . , 1 < fc; < m;, onto the intervals Ta, l \lk!,...,ki)- 
It induces a permutation of the intervals Ifc 1 ,...,fc„_ 1 , where 1 < fci < mi,...,l < 
kn-i < m„_i. 

(ii) When r„_i(x) > 0, we have 

W i Cradle- * = 0, . . . , r„_i(z), (52) 
and, when t„_i(x) < 0, we have 

T L-. (*) ^ Cmcuo * = Tn-l (x), • • • , 0. (53) 

(iii) There is a set of "good" indices J^-i Q {1 < &i < m i ; ■ • • , 1 < fc n -i < m n _i}. For 
(fci, ... , fc„_ 2 ) S J^_2 we have that (fci, . . . , fc„_ 2 , fc n _i) S as well as 

M[A 1 ,...,fc B _ a n {r„_ 2 ^ r„_i}] < ^ff/i[/fc 1 ,...,fc B _ a ], (54) 

and 



E / |b"- 2 (^)-^- 2 (riir 2 2) W)] 

(fel,-,fcn-2)eJ»_ 2 ^ J *l--*»-2 

^ c(Mi,...,M„- 2 ) 



(55) 



(iv) There is a set of "singular" indices J^-i Q {(fci, ... , k n -i) ■ 1 < fci < mi, . . . , 1 < 
k n -i < m„_i}, disjoint from J„-\, such that J^-i consists of less than 2M^_ 1 many 
elements and such that 

l p n -\x)-^-\Til^\x)) < 0, farielfe k,., 

and (fci, . . . ,fe„_i) G J*_i, 

and 

(fei,..,fcn-i)eJ n _ 1 / fcl ,..^ B _ 1 (57) 



< _1 _|_ _2_ + C ( M 



mi TM2 

where c(-) are constants depending only on (•). 



+ ••■ + 



c(Mi,...,M„_ 2 ) 
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(v) On the middle interval ^ liddlc = we have n = r 2 = • • • = r„_i = and I^ iddle 

2 

together with the intervals (-ffci,...,fc n _ I )(fc 1 ,...,fc n _ 1 )ej' s jUJ 3 ± form a partition of [0, 1). 

We have to define r n as well as J% and so that the above list is satisfied with n — 1 
replaced by n. 

Let us illustrate graphically some features of this construction. Namely, the fractal 
structure of the singular set and the resulting quasi-cost. 

h 

I ) 

n = 1 | | 

n = 2 | | | 
«=3 I 

n = 4 



Fig. 6. The fractal structure of the "singular" set. 

For the sake of simplicity of the drawing, the red area which represents the singular set is 
thicker than it should be. Note also that the effective singular set is not perfectly balanced. 



2 
1 + 





-M n /Ml_ 1 



i 

Mi 



Ml-l 
Mi 



II II 



(T„) 



Fig. 7. Shape of the quasi-cost ip n + ip n o Ta 
The strips on this graphic representation symbolize the oscillations of the function (p n + ip n o 



Tu™\ On the ' 'singular" set, this finction achieves values of order —M n /M^_-^. Of course, 
the effective singular set is much more fragmented than it appears on this figure. 

We start with a "good" interval Ifa k n -i > i- e - (^i> ■ ■ ■ > &n-i) £ ^n-i ano - s i m ply write r 
for T n -i\i h k . If r > 0, define J fc i>— .kn-i,^ where c stands for "change", as {m„ — r+1, 
. . . , to„}. This set consists of those indices fc n such that the interval Ik x ,...,k n is not mapped 



into T&r^Ifcx 



,fcn- 



under T t 



(r„_i) 



If t < 0, we define J fcl > 



as {1, 



-!}• 
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The complement {1, . . . , m n }\J fel '""' fen_1 ' c is denoted by J k i>---> k n-i,u^ wnere u stands for 
"unchanged" . 

Define r„ := r„_i = r on the intervals /fe 1 ,...,fc n _ 1 ,fe n , for fc„ G J«i>---> fe »-i> u . For a; in one 
of those intervals we have by (52), (53) and (47) that 

<p n (x) - <p n (T<£\x)) = V n ~\x) <p n -\T&-?\x)), 

which yields (54) with n — 1 replaced by n. 

On the remaining intervals Ik u ...,k n with k n G jki,...,k n -i,c wc ci enne Tn suc \y that it takes 
constant values in {— M n +1, . . . , M n — 1} on each of these intervals, such that (52) (resp. (53)) 
is satisfied, and such that these intervals Ik x ....,k n are mapped onto the "remaining gaps" in 

^ (Ikl ,...,k n -l ) ' 

The crucial observation is that the intervals -ffe ll ...,fc n _ ll fe n where we have r„ ^ t„_i, 
i.e. where fc„ G J fe i»---> fe n-i» c 5 are all on the "boundary" of Ik 1 ....,k n ^ 1 - they are the |r| many 
intervals on the left or right end of /fe l! ..../t„_ 1 , depending on the sign of r. Similarly, the 
"remaining gaps" in T^]^ 1 ' '(^fei,...,fe„_i ) are the |r| many intervals on the opposite end of 
7a^"" 1 1 ^(/fc l! ... ; fe rl _ 1 ). Hence we may apply assertion (iii) of Lemma 3.4 to conclude that 

\<p n {x) - <p n {T&\x))\ < c(Mi, . . . , M„_i), 

for those x G /fei,...,fe„_i wherer n (x) 7^ t„_i(x). Summing over all "good intervals" fe n _u 

where (fci, . . . , fc„_i) G we conclude that the contribution to (55), with n — 1 replaced 

by n, is controlled by the following factors: M„_ 1; which is a bound for the number of 
elements in J^_ 1; times M n _\, which is a bound for r|, times -p-, which is the length of the 
intervals Iki,...,k n , times the above found constant c(M\, . . . ,M„_i). In total, this implies 
the estimate (55), with n — 1 replaced by n. 

We now turn to item (iv), i.e. to the "singular" indices: fix fci, . . . , fc„_i G and let 

A<£ denote the constant 

A^ := <p n -\T<£^\x)) <P n -\x), x G 4,,...,^, 

and again r the constant T n -\, Ik k , so that < Aip < \t\ < M n _\. 
Similarly as for the case n = 2 define 

Tfei,...,fc n _i,5,; _ Cil il , 1 ro„-l 1 Tfei,...,fe n -i,g,r _ ( m n + l {.ri 

J ^ ISi "Vi "T" 1 • • • 1 2 J' J — I 2 '•■•' ft nJ- 

Here fc^ is the largest number such that, for the orbit (T* n (x))l^' pMn ~ 1 1 and for x G 
Ik 1 ,...,k n _ 1 ,k^ l , all its members lie in the right half of the respective intervals Ik' 1 ,...,k' n _ 1 - In 
fact, we get as in the step n = 2 that k r n = m n — (r + A<^M n _i). 

Similarly k l n is the smallest number such that, for the orbit (T£ (x))^ =1 Av,M " _1+1 and for 
x G Ifei fe n _i,fe5,) a H its members are in the left half of the respective intervals Ik' 1 ,...,k' n _ 1 - 
We get k l n = t - A(^M„_i + 1. 
Now we define r„ as 

t„(x) =t + AvjM„_i, for X G I ku ...,k n - U k n ,kn G jfci.-.fcn-i,a,r ) 

and 

t„(x) = t - A(^M„_i, for x G h u ...,k n -nk n ,k n G J fe i>-> fe n-i>9.'. 
Similarly as in (44) at step n = 2, we get for fc„ G jki,-,k n -i,g := jfei,-,fcn-i,a,« u jfei,...,fcn-i,9,r ) 



2G 



and x £ Iki,...,k„-i,k„ that 

<p n {x)-<p n {T<£\x)) 

= [<p"{x) <p n {T&-*\x))\ + [<p n {T<g-J{x)) <p n {T<£\x))] 

= [<p"-\x) <f n -\T^\x))\ + [<p n {T<r r J(x)) - <p n {T£\x))] 

= -A(p + Atp = 0. 

We still have to deal with the "singular" indices 

jk^K.us ;= {1) ^ mn} \ Jkl ,..,k n - ug = {i, . . . , ^ _ 1} u {fc; + 1, . . . , m n }, 

which consists of 2A</?M„_i many indices. This number is bounded by 2M^_ 1 as Aip < 
|t| < M„_i. These intervals have to be mapped onto the "remaining gaps" in the interval 

Ta™S 1 1 \lk 1 ,...,k„- 1 )- Make the crucial observation that, while the intervals Ik 1 ,...,k n -i,k„> f° r 
k n G jfei.— .fcn-i,« j are a ^ the boundary of 7fc 1) ... i fe n _ 1 , the "remaining gaps" are in the middle 

of the interval Ta"Si\lk 1 ,...,k n - 1 )- This fact is analogous to the situation for n = 1 and 
n = 2. 

Now define r„ on the intervals Iki,...,k n -i,k n f° r ^« e jfei,—.fcn-i,« ) m SVLC h a way that 

Ta™^ maps these intervals onto the "remaining gaps" in TaX-i\lki,...,k n -i) an d sucn that 
t„ is constant on each of these intervals, takes values in {— M n + 1, . . . , M n — 1} and such 
that (52) (resp. (53)) is satisfied with n — 1 replaced by n. Applying Lemma 3.4, assertion 
(ii) as well as 2(M„_i + l)|r| many times assertion (i) we obtain, for x G // Sl ,...,fe n _ 1 ,fe n and 

L ff Tfei,...,fc n _i,a 

^(a) - <p n {T&\x)) < -gfc + c(M!, . . . , M n _!). 

Assuming that m„ is sufBciently large as compared to M n _i we have that the right hand 
side is negative. 

Keeping in mind that there are 2A(/)M n _i many indices in J«iv,fcn-i,s ; we ma y estimate 
the "singular mass" on the interval Ik lt ...,k n -i by 



<2A ¥ >M n _i[- 55 2»- + c(M 1 ,...,M n _ 1 )] -jr^- 



(58) 



2M„_i 1 ^V""i' • • • ^"n-LJi M) 

^ n _ 

M, 

We have by the inductive hypothesis that 



_ Ay m _ c(Mi,...,M„_i) i 



E / b- 1 



< _j , _3_ , c(Mi) j , c(Mi M„_ 2 ) 

— mi m2 m n _i ' 

or, writing now A^fc 1) ... ) fe n _ 1 for the above value of Aip on the interval Ik lt ...,k n -ii 



l 



M n _i / y f«i,-,Kn-i — mi m 2 m„_i 

fci ,...,fc„_i£ J£ _, 
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Letting J* := (J {(fci, . . . , fc„_i, fc„) : fc„ G J fc i.-.fen-i,«j. we obtain from (58) 
fci,...,fc„ejj J ik 1 ,...,k n 

< C_ 1 j 3 i c(Mi,...,M„- 2 ) w 1 c(Mi,...,M„_i) n 

— V A m-1 ' ' ' ' m„_i ^ m„ / 

_ _j | _3_ | , c(M 1 ,...,M„_ 2 ) , e(M 1 ,...,M„_ 1 ) 

where we may have increased the constant c(l, . . . , M„_i) in the last line. This concludes 
the inductive step. 



Construction of the Example: Let a = lim^oo a n so that T a = lim^oo T an is the shift by 
the irrational number a. 

The sequence (T n )^ = i of functions t„ : [0, 1) — > Z converges, by (54), almost surely to a 
Z- valued function r = lim^—joo t„. Hence the maps converge almost surely to a 

map 



T (r) . 



Using the fact that each is a measure preserving almost sure bijection on [0, 1), it is 

straightforward to check that is so too. 

Letting L T = {(x,Ta\x)) 1 x G [0,1)} in analogy to the notations L = {(x,x), x G 
[0,1)} and Ti = {(x,T a (x)), x G [0,1)}, we define 



[0,1) - [0,1) 

x * t£\x)=T^ x \x). 



c{x,y) 



h+(x,y), if {x, y) G Lo U Li U r r , 
oo otherwise, 



where h is defined in (15) above. From this definition we deduce the almost sure identity, 
for t(x) > 0, 

h(x, T£\x)) = #{i G {0, . . . , t(x) - 1} : T a {x) G [0, §)} 

- #{t G {0, . . . , t{x) - 1} : T*(z) G [±, 1)} + 1 (59) 
= lim [<p»{x)-<p n {T<£\x))] + l, 

n— >oo 

a similar formula holding true for t(x) < 0. 

As regards the Borel functions ((p n ,ipn)%Li announced in (17), (18) and (19) above, we 
need to slightly modify the functions (<£™, ^™)$£ =1 constructed in the above induction to 
make sure that they satisfy the inequality 

<p n (x) + ip n (y) < c(x, y), for x G X, y G F. (60) 

As c = oo outside of r U Fi U F T it is sufficient to make sure that the following inequalities 
hold true almost surely, for x G [0, 1) : 

(0) <p n (x) + i> n (x) < C(X,X) = 1, 

'2, for x G [0,i), 



(1) <p n (x)+^ n (T a {x)) < c(x,T a (x)) = j 
(r) + < c(x,T^)(x)). 



0, for x G [±,1) 



28 



The above constructed (<p n , tp n )^ =1 only satisfy condition (0). We still have to pass from <p n 
to a smaller function ip n - while leaving tp n := ip n unchanged - to satisfy (1) and (r) too. 
Let 

tp n (x) := <p n [x) - [<p n (x)+r(T a (x))-c(x,T a (x))] + 

[<p n (x)+r(Ti T \x)) c(x,T^(x)} + . 

Clearly ip n < (p n and the functions (tpmipn) satisfy the inequality (60). 

We have to show that the functions ip n defined in (61) satisfy that (p n — ip n is small in the 
norm of L 1 ^), as n — > oo, that is 

lim / {<p n (x) - <Pn{x)) dx = 0, (62) 
n ->°°J[o,i) 

provided that (m n )%L 1 increases sufficiently fast to infinity. 
We may estimate the first correction term in (61) by 

[ V n {x)+r{T a {x))-c{x,T a {x))] + 

< \r{T a {x)) - r(T a Ax))}+ + [cp n (x) + 1> n (T an (x)) - c(x, T a {x))}+. 

The second term above is dominated by l/™ iddlo which is harmless as || lj£ iddle lU 1 ^) = tjt- 
As regards the first term, note that T a (x) T an (x) = a — a n = ^2jL n +i ~M~ which we may 
bound by M 2 +1 by assuming that (m n )%L 1 increases sufficiently fast to infinity. As tp n is 
constant on each of the M n many intervals Ifci,...,fc„ we get 

fi{xe [0,1) :^ n {T a (x))^^ n (T an (x)} < M n {a-a n )<^. 

On this set we may estimate, using only the obvious bound \4> n (x)\ < M n , that 

\1> n (T a (x)) - i> n {T an {x))\ < 2M n , x e [0, 1), 

to obtain 

\\r(T a (x)) -r{T an {x))\\ L , w <^. 

Hence for (m n )^ =1 growing sufficiently fast to infinity, the first correction term in (61) is 
also small in Z^-norm. 

To estimate the second correction term in (61) note that 

V n (x)+r(Ti T \x)) = l p n (x)+r(T^\x)), for x e [0,1). (63) 

Indeed, T a T ^ induces a permutation between the intervals Ik 1 ....,k n and, by assertion (i) pre- 
ceding the formula (52), we have that Ta™*?^ maps the intervals Iki,....k n onto the intervals 
Ta^ (h u ...,k n ), for each j > 0. Noting that ip n is constant on each of the intervals h 1 ,...,k n 
we obtain (63), by letting j tend to infinity. 

By (47), ip n (x) + tp n (T a T n n \x)) is the number of visits to L n minus the number of visits to 
R n plus one, of the orbit (T^J) 7 ^^ 1 . Similarly, by (15), h(x,Ta^ x \x)) is the number of 

visits to L minus the number of visits to R plus one, of the orbit (T^)J^ 1 . We have to 
show that the positive part of the difference 

/„(*) := [ l p n (x)+r(T a T :\x)) h + {x,T£\x))] + , x e [0,1), (64) 

is small in i 1 -norm, as n — > oo. To do so, we argue separately on /fiddle = [| — jMl > 5 + 2M^1 ' 

on the union of the "good" intervals at level n : G n = U(fei k )e J 3 an< ^ ^he un i on 

of the "singular" intervals at level n, S n — {J( kl k n )eJ s ^ki,...,k n - 
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- For X G ^middle' 

the correction term f n (x) in (64) simply equals zero as r„(x) = t(x) = 

0. 

- For x G S n , we have by (56) that <p n (x) + ip n {Tl n n (x) (x)) < 1 so that f n (x) < 1 too; 
hence lim^oo ||/„ls n Wl 1 ^) = 0- 

- For x G G n , we use 

fn(x) < [<p n (x)+r(Ti T :\x)) h{x,T^\x))) + 
oo 

< J2 [& k -\x) + ^-\Til^\x)))-(v\x)+^\T£\xm + 

k=n+l 

and (55) to conclude that 

oo 
k=n+l 

This proves (62). 

Hence (17), (18) and (19) are satisfied. 

As regards assertion (16), let us verify that no and tti are optimal transport plans. 
Indeed, it follows from (17) and (18) that the dual value of the present transport problem 
is greater than or equal to one which implies that (c, n ) — (c, tt\) = 1 is the optimal primal 
value. 

The fact that (c,tt t ) > 1 should be rather obvious to a reader who has made it up to 
this point of the construction. It follows from rough estimates. The set {[0, |) (~l {r = 

— 1}} U {[^, 1) fl {r = 1}} has measure bigger than 1 — + c ^ Ml, ' m ' M '~ 1 ^ , which is 

bigger than, say, |, for (m n )^ 1 tending sufficiently quick to infinity. As c(x, T&\x)) equals 
2 on this set we get 

(C,7T T ) > | > 1. 

A slightly more involved argument, whose verification is left to the energetic reader, shows 
that, for e > 0, we may choose (m n )^ =1 such that 

(h, tt t ) > 2 - e. (65) 

Finally, we show assertion (iv) at the beginning of this section (see (20)). Let h G L 1 ^)** 
be a dual optimizer in the sense of [BLS09, Theorem 4.2]. We know from this theorem that 
there is a sequence ((p n ,ip n )^ =1 of bounded Borel functions 5 such that 

(a) lim \\[ip n ®tp n - c]+\\ L i M = (66) 

n— >-oo 

(/3) lim ( f <p n (x)dn(x) + ( Mv) dv{y)) = 1, (67) 
Jx Jy 

(7) lim (p n (Bipn — h r , 7r-a.s., (68) 

n — >oo 

(6) his a £7(L 1 (7r)**,L°°(7r)) cluster point of (ip n ip n )n=i- ( 69 ) 

Here h — h r + h s is the decomposition of h G L 1 (-7r)** into its regular part h r G L 1 ^) and 
into its purely singular part h s G L 1 (-7r)**. 



5 The ((fin, ipn) need not be the same as the special sequence constructed above; still we find it convenient 
to use the same notation. 
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We shall show that h r equals h, 7r-almost surely. Indeed by assertions (66) and (67) above 
we have that, for x G [0, 1), 

lim (<p n (x) + ip n (x)) = c(x,x) = h(x,x) = 1, 

and 



lim (<p n (x)+1> n (T a (x)))=c{x,T a (x)) = h{x, T a {x)) = 



2, for x G [0, \) 
0, for x G [§, 1) 



the limit holding true in i 1 ([0, 1], n) as well as for /x-a.e. x G [0, 1), possibly after passing to 
a subsequence. As in the discussion following [BLS09, Theorem 4.2] this implies that, for 
each fixed i £ Z, 

lim (^„(z) + VnCO*))) = /i(z,T*(z)), t G Z, 

n — >oo 

the limit again holding true in L 1 ^) and /i-a.s., after possibly passing to a diagonal subse- 
quence. Whence, we obtain with (68) that 

lim K(x) + ^(TM(x))) = h(x,T^(x)) = h r (x,T^(x)), 

n — >oc 

convergence now holding true for /i-a.e. x G [0, 1]. 

As x — > Ta T ^(x) is a measure preserving bijection we get 

/ [^ n {x)+^ n {T^\x))]dx = I {ip n (x)+ip n (x))dx = l, 

J [0,1) J [0,1) 

so that, using (65) we get 



n^oc 



Jim / [0 i) bnW + i(^ T) W)]l { ^ (l)+ ^ {T M (l))<Mjir W (l))} (x)& 



= 1 _ J™o / Q ^^"^ + ^( T « T) ( 2; ))] 1 { V n(x) + ^(Tt^(x))>/ l (x,T^)(x))}( a; ) dx 
= l-(/l,7T T ) 

< 0. 

From limn^oo^jx : ip n (x) + ip n (Ta\x)) < h(x,Ti T \x))} = we conclude that each a*- 

cluster point of ([<p n ( m ) + 0n(^o T '(-))]-)S? = i is a purely singular element of L 1 ^)** of norm 
equal to (h,7r T ) — 1. 

Finally, we still have to specify the prime numbers (m n )^ =1 in the above induction. 
It is now clear what we need: apart from satisfying the conditions of Lemma 3.1 as well 
as the requirements whenever we wrote " for m n tending sufficiently fast to infinity", we 
choose the (m n )%L 1 inductively such that in (54) we have ^1 < 2-™, that in (55) wc have 

C (M 1 ,...,M„_ 2 ) 2 _„ and in (5?) wc hayc 1 n in c(M 1 ,...,M„- 2 ) 2 _„ 

m n _i v ' mi 4 ° rn n -i 

Hence we have shown all the assertions (i)-(iv) of Example 3.1 and the construction of 
the example is complete. □ 

4 A Relaxation of the Dual Problem 

As in [BLS09, Remark 3.4], for a given cost function c : X x Y — > [0, oo], we consider the 
family of pairs of functions 

(ip, ip) : ip, ip Borel, integrable and 
* rcl (/i,z/) = { <p{x) +i>(y) < c(x,y), 7r-a.s, 

for each finite transport plan ir G II( / u, v, c) 
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and define the relaxed value of the dual problem as 

D Tel = sup{J tpd^ + J V dv : (<£,V>) G * r °V^)}- (70) 

Using the notation of [BLS09] it is obvious that D < D rcl and it is straightforward to verify 
that the trivial duality inequality D rcl < P still is satisfied. One might conjecture - and the 
present authors did so for some time - that D lcl = P holds true in full generality, i.e. for 
arbitrary Borel measurable cost functions c : X xY — > [0, oo], defined on the product of two 
polish spaces X and Y. In this section we construct a counterexample showing that this is 
not the case, i.e. it may happen that we have a duality gap P — D Tcl > 0. The example will 
be a variant of the example in the previous section, i.e. the (n + l)'th variation of [AP03, 
Example 3.2]. 

In section 3 we constructed a measure preserving bijection : [0, 1) -> [0, 1) having 

certain properties; we now shall construct a sequence (Ta Tn ^)^L °f such maps and consider 
as cost function the restriction of h + , where h is defined in (15) to the graphs (r„)^ of 
the maps {T^ rn ^)'^L . This sequence also "builds up a singular mass", which now is positive 
as opposed to the negative singular mass in the previous section, but it does so in a different 
way. We resume the properties of these maps which we shall construct in the following 
proposition. 

Proposition 4.1. With the notation of section 3 there is an irrational a G [0,1) and a 
sequence (t„)^L of maps r„ : [0, 1) — > 7L, with tq = and t\ — 1, such that the transforma- 
tions T { a n) : [0, 1) -> [0, 1), defined by 

Tt\x)=T^\x), ore [0,1), 

have the following properties. 

(i) Each T n is constant on a countable collection of disjoint, half open intervals in [0,1) 
whose union has full measure. For n > 0, the map T^"" 1 defines a measure preserving 
almost sure bijection of ([0,1), fx) onto itself, where fi — v denotes Lebesgue measure 
on [0, 1). We have, for each n > 0, 

[ h{x,T^\x))dx = l. (71) 

i[0,l) 

(ii) The function 

f n (x):=h(x,T^(x)), ore [0,1), 
where h is defined in (15), satisfies 

\\fn-9n\W W <^ n (72) 

where g n is a Borel function on [0, 1) such that 

V{gn = 0} = 1 - T} n , ^{g n = i^-} = rj n (73) 

for some sequence (»? n )£Li tending to zero. 

(Hi) There is a sequence (ip n , 4> n )%~Li of bounded Borel functions such that, for every fixed 
n e N, 

Urn \\h{x,T^\x)) - [p m {x)+^ m {T^{x))]\\ L , {lx) = 0, 

and 

1. 



lim 

n— ^oo 



/ ip n (x)dx+ / ip„(y)dy 

J [0,1) J [0,1) 
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(iv) The sequence (Ti r "' l )^ 1 converges to the identity map in the following sense: 

5{x,T^\x))<2- n , xG [0,1), n>l, (74) 
where S(-, •) denotes the Riemannian metric on T = [0, 1). 

We postpone the proof of the proposition and first draw some consequences. Suppose 
that a as well as (Ta n ^)^L have been defined and satisfy the assertions of Proposition 4.1. 

Proposition 4.2. Fix M > 2 and define the cost function cm ■ [0, 1) x [0, 1) — > [0, oo] fry 

cu{x,y) 



' W), /or (x, y) m the graph of T°, 1* 7i T2 \ Ti r3) , . . . , T^™\ 

oo, otherwise. 



For this cost function cm we find that the primal value, denoted by P M , as well as the dual 
value, denoted by D M , of the Monge-Kantorovich problem both are equal to 1. 
In addition, there is (3 — f3(M) > 0, such that, for every partial transport 

o~ E IP art ( M , v) := {cr : M(X x Y) : Px (n) < »,p Y {n) < v} 

with 

||er|| > | and / Cm{x,v) da(x,y) < |, 

JXxY 

there is no partial transport g G n part (/x, v) with 

\\a + g\\ = 1 and a + g e Il( / u, !/) 

w^/i the property that g is supported by 

^ = {{x,y)e[Q,l) 2 :5{x,y)<P}. 

Proof. First note that there is an open and dense subset G C [0, 1) of full measure n{G) = 1 
such that cm, restricted to G x G is lower semi-continuous. This follows from assertion (i) of 
Proposition 4.1 by replacing the half open intervals by their open interior. Noting that G is 
polish we may apply the general duality theory [Kel84] to the cost function cm restricted to 
G x G to conclude that there is no duality gap for the cost function cm\gxG- It follows that 
there is also no duality gap for the original setting of cm, defined on [0, 1) x [0, 1), either. 

We claim that, for every M > 0, the value D M of the dual problem equals 1. Indeed, let 
(tp n ,ipn)%Li be a sequence as in Proposition 4.1 (iii). Defining 

M 

(Rn ■= fn - J>n(aO + W) ~ h{x,T^\x))] + 

3=0 

and ip n = ip n , we have that 

ip n (x) +4> n {y) < h(x,y) < h+(x,y), 
for all (x, y) in the graph of T°, T^,Ti T2 \ T^™\ and 



lim 

n— >oo 



/ ifn(x) dx+ $n{y)dy 
-Jx Jy 

showing that D M > 1. It follows that D M = P M = 1. 



1, 
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Now suppose that the final assertion of the proposition is wrong to find a sequence 
(0?i)5?Li € II part (^,^) with ||er„|| > | and J XxY c m(x, y) da n (x,y) < |, as well as a sequence 
(f?n)J£=i S IT part (^, ^) with ||7r„ + = 1 and 7r„ + g n G such that p„ is supported 

by 

A 1 /" = {(x,y)e[0,l) 2 :5(x,y)<i}. (75) 

Considering (crn)^i as measures on the product G x G of the polish space G, we then can 
find by Prokhorov's theorem a subsequence (cr nk )kLi converging weakly on G x G to some 
a- G Il part (^, ;/), for which we find \\a\\ > | and f Xx Y c ( x >y} ^O^f) — 5- By passing once 
more to a subsequence, we may also suppose that (g nk )kLi weakly converges (as measures 
on G x G or [0, 1) x [0, 1); here it does not matter) to some g G n part (/z, v) for which we get 
\\a + q\\ = 1 and cr + £ G n(/x, f ). By (75) we conclude that g induces the identity transport 
from its marginal px{g) onto its marginal py(g) — Px(g). As cm{x, x) = 1, for x G [0, 1) wc 
find that J XxY c M{x,y) dg(x,y) = \\g\\ < |, which implies that 



/ 



c M (x,y) d(iT + g)(x,y) < \ + ± 



a contradiction to the fact that P M = 1 which finishes the proof. □ 

We now can proceed to the construction of the example. 

Proposition 4.3. Assume the setting of Proposition 4.1. For a subsequence {ij)j^2 °f 
{2, 3, . . .} we define the cost function c : [0, 1) x [0, 1) — > [0, oo] by 

c(X; y) = fh+(x, y), for (x, y) in the support of T°, T^T^^M^, . . . , . . . (?g) 

^oo, otherwise. 

If (*j)j^=2 t en ds sufficiently fast to infinity we have that, for this cost function c, the primal 
value P is strictly positive, while the relaxed primal value P rcl (see [BLS09, Example 4.3}) 
as well as the dual value D and the relaxed dual value D rcl (see (70)) all are equal to 0. 

In particular there is a duality gap P — D Tcl > 0, disproving the conjecture mentioned at 
the beginning of this section. 

Proof. We proceed inductively: let j > 2 and suppose that in — Q,i\ — l,i2,---,ij have 
been defined. Apply Proposition 4.2 to 



cj(x,y) 



'h+(x,y), for (x,y) in the support of T°, T Q \ T { J i2 > , T ( J* 3 \ . . . ,T^\ 
oo, otherwise, 



to find (3j > satisfying the conclusion of Proposition 4.2. We may and do assume that 
(3j < min(/3i, . . . , Pj-i). Now choose ij + i such that 

S(x,T^ +1 \x)) <0j, are [0,1). (77) 
This finishes the inductive step and well-defines the cost function c(x,y) in (76). 

(t 4 .) 

By (71) each T a 3 induces a Monge transport ■K ij G IT(^, v) which satisfies 
h(x,y)diTiAx,y) — / h(x,Ta' 3 )dx = l. 



XxY JX 



The fact that the relaxed primal value P rcl for the cost function c equals zero, directly 
follows from the definition of P rel [BLS09, Section 1.1], (72) and (73) by transporting the 



34 



measure /il{ ffn= o}j which has mass 1 — T] n , via the Monge transport map Ta " where n is a 
large element of the sequence Hence we conclude from [BLS09, Theorem 1.2] that 

the dual value D of the Monge-Kantorovich problem for the cost function c defined in (76) 
also equals zero. 

Finally observe that we have D = D Tel in the present example: indeed, the set {(x, y) G 
[0, l) 2 : C(X, y) < oo} is the countable union of the supports of the finite cost Monge transport 

plans T°, Tl, T { J z1 \t { J 12 \ Ta h \ so that the requirements (p(x) + ip(y) < c(x, y), 
for all (x,y) E [0, l) 2 , and ip(x) + ip(y) < c(x,y), 7r-a.s., for each finite transport plan 
7r e v), coincide (after possibly modifying ip(x) on a /i-null set). 

What remains to prove is that the primal value P satisfies P > 0. We shall show that, 
for every transport plan ir G H(fj,,v), we have J Xx yc(x,y) dTr(x,y) > |. Assume to the 
contrary that there is it g II(/z, z/) such that 

J c(x, y) dTr(x,y) < \. 

XxY 

Denoting by Oj the restriction of 7r to the union of the graphs of the maps T°, Ta %1 \ 

Ta %2 \ ■■■jT'a ^ , each Oj is a partial transport in II part (yU, v) and the norms (||<Tj||)jS=i 
increase to one. Choose j such that 

11^ II \ 2 

II°jII > g- 

We apply Proposition 4.2 to conclude that there is no partial transport plan gj such that 
ftj + Qj € n(/i, !/), and such that is supported by A^. But this is a contradiction as 
Qj = 7r — (Tj has precisely these properties by (77) . □ 

Proof of Proposition 4-1: The construction of the example described by Proposition 4.1 will 
be an extension of the construction in the previous section from which we freely use the 
notation. 

We shall proceed by induction on j E N and define a double-indexed family of maps 
T n j : [0, 1) — > Z, where 1 < n < j. 
Step 7 = 1: Define 

t m : [0, 1) - Z 

as 

71,1 = — Tl, 

where we have m\ — Mi, ai = and n as in (28) above. At this stage the only difference 
to the previous section is that we change the sign of t\ as we now shall build up a "positive 
singular mass", as opposed to the "negative singular mass" which we constructed in the 
previous section. More precisely, defining ip 1 , ip 1 as in (27), we obtain, similarly as in (29) 

0, fora;e/fc 1 ,fcie{2,...,(Mi-l)/2, 
(Mi+3)/2,...,Mi-l}, 

(Mi-l)/2, fora;e/ fel ,fci = l,Mi, 

1, for x e / (Ml +i)/2- 

This finishes the inductive step for j = 1. 

Step j = 2: Let mi and M 2 = M\mi be as in section 3, where m 2 satisfies the 
requirements of Lemma 3.1, and still is free to be eventually specified. To define ■ 
[0, 1) — > Z we want to make sure that the map T^ 1 ' 2 ^ maps the intervals 1^ bijectively onto 
T^ 1 ' 1 \lk 1 )- Using the notation of the previous section, we consider all the intervals 1^ as 
"^oocf' intervals so that we do not have to take extra care of some "singular" intervals. 



y\x)+i,\T^\x)) = < 
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More precisely, fix 1 < k\ < Mi, and write r for T\ t \\i k . If r > 0, define J kl,c as 
{m 2 — t + I, . . . ,m 2 }, i.e. the set of those indices k 2 such that the interval Ik ± ,k 2 is n °t 
mapped into T^ 1 ' (I kl ) under T^ 1 ' . If t < 0, we define J fcl > c as {1, ... , |r|}, and if r = 0, 
we define J kl ' c as the empty set. The complement {!,... ,m 2 }\J fcl,c is denoted by J kl ' u . 

Define ti j2 '■= n,i = r on the intervals /fei,fe 2 ) f° r ^2 S J fel '". On the remaining intervals 
^fei,fe 2 w hh k 2 G J fcl ' c we define ri,2 such that it takes constant values in {— M 2 + 1, . . . , M2 — 
1} on each of these intervals, such that (37) (rcsp. (38) is satisfied, and such that these 
intervals Ik t ,k 2 are mapped onto the "remaining gaps" in T^ 1 ' (Ifa). 

Using again Lemma 3.3 we resume the properties of the thus constructed map T^ 1 ' 2 ' : 
[0,1) -[0,1). 

(i) The measure-preserving bijection T^ 1,2 * 1 maps each interval I kl onto T^ 1 ' 1 ' (Iki)- It 
induces a permutation of the intervals Ifc 1 .fe 2 , where 1 < k\ < Mi, 1 < k 2 < m 2 . 

(ii) Defining ip 2 , ip 2 as in (32) we get, for each 1 < fci < M 1; similarly as in (39) and (40) 



Mtffein {71,2^71,!}] < f>[4J, 



as well as 

Mi 



E / i(^ 1 w-^(^ i) (x))-(^( a; )-^(T(7^( a; ))Mx< 
fe 1= i 

(iii) On the middle interval /fiddle = ^Mi±i we have ri j2 = n,i = 0. 



4Mf 
m 2 



We now pass to the construction of the map t 2i2 : [0, 1) 
I < fci < Mi, and x G h u k 2 , 



Z. We define, for each 



a 2 (k 2 ), for fc 2 e {!,..., Mi} 

-Mi, for fc 2 G {Mi + I, . . . , (m 2 - I)/2}, 

t 2 , 2 {x) = { 0, for k 2 = (m 2 + l)/2, 

Mi, for fc 2 e {(m 2 + 3)/2, . . . ,m 2 - Mi}, 

a 2 (k 2 ), for k 2 G {m 2 - Mi + 1, . . . , m 2 }. 

The definition of the function a 2 on the "singular" intervals Ik u k 2 1 where fc 2 G {I, . . . , Mi}U 
{m 2 — Mi + 1, . . . , m 2 } is done such that T^ 2,2 '' maps these intervals onto "remaining gaps" 
/fe ll i 2 , where l 2 runs through the set 

{(m 2 - l)/2 - Mi + I, . . . , (m 2 - l)/2} U {(m 2 + 3)/2, . . . , (m 2 + 3)/2 + Mi - 1} 

in the middle region of the interval Ik t ■ As above we require in addition that a 2 on each 
Ik x .k 2 takes constant values in {— M 2 + I, . . . , M 2 — 1} and that (37) (resp. (38)) is satisfied. 

The function t 2 2 mimics the construction of ti ; i above, with the role of [0, 1) replaced 
by each of the intervals I kl , for 1 < k\ < M\. The idea is that, T^ 1 being the identity map, 
we have that T™ 1 satisfies T^(x) = x © f| and jfe = ^ is small. Hence the role of T ai 
in the previous section now is taken by T™ 1 . 

More precisely, we have, for each ki = 1 , . . . , Mi, and x G I kl ,k 2 

^ 2 (x)+^(T^\x)) = 

fo, 



for k 2 G {Mi + 1, . . . , (m 2 - l)/2, 
(m 2 + 3)/2,...,m 2 -Mi}, 
^f r + 7 (Mi), forfc 2 G {I,...,Mi}U{m 2 -Mi + l,...,m 2 }, 
for k 2 = (m 2 + l)/2. 



(78) 



J. 
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The notation j(Mi) denotes a quantity verifying |7(Mi)| < c{M\) for some constant c(M\), 
depending only on M\. The verification of (78) uses Lemma 3.3 and is analogous as in 
section 3. 

As Ti T 2 2 - 2) defines a measure preserving bijection on [0, 1), we get 

/ ((p 2 (x)+^ 2 (Ti T 2 ^\x)) dx= f (ip 2 (x)+^ 2 (x))dx = l. (79) 
Jo Jo 

This finishes the inductive step for j = 2. 

General Inductive step: For prime numbers mi, . . . ,mj-i as in the previous section 
suppose that we have defined, for 1 < n < j — 1 maps r n j : [0, 1) — > Z such that the following 
inductive hypotheses are satisfied. 

(i) For 1 < n < j — 1, the measure preserving bijection Ta™' 3 ^ 1 ^ : [0, 1) — > [0, 1) maps 
the intervals Ik 1 ,...,k n - 1 onto themselves. It induces a permutation of the intervals 
Iki,...,ki-i ! where 1 < k\ < m\, . . . , 1 < fcj-i < m,j-\. 

iii) For 1 < n < j — 1 we have, for 1 < ki < mi, ... ,1 < fcj-2 < mj-2, 

v\.Ik lt . ..,kj- 2 n {r„ ;j _ 2 ^ r nJ _i}] < ^/i[/ fcl ,...,fc,_ 2 ], (80) 

and 

E / |(^"- 2 (x)-^- 2 (r(;--)(x))) 

l<fei<mi,...,K(; j _2<m j _2 7fc i ,---.%-2 

-(^'- 1 w-^'- 1 (T&r i) (»))) 



(81) 



e(Mi,...,Mj- 2 ) 



We now shall define r n j : [0, 1) — > Z, for 1 < n < j and t 3J : [0, 1) — ► Z. 
Fix 1 < n < j — 1 as well as 1 < fci < mi,...,l < fcj-i < mj-i. Denote by r 
the constant value T n j_i\i ki fc i . If r > define J fe i>---,fe 3 -i,c as j TO _. — T + 1, . . . , m^}, 

similarly as for the case j = 2 above. If r < define jfci,-..,fcj-i,c as |r|} which, for 

t = 0, equals the empty set. On the intervals Iki,...,kj-i,kj where % lies in the complement 
jki,...,kj-i,u _ jTO ^.j.^jfei,...,fe i _i,c we define r„j := r„j_i. On the remaining intervals 

Iki,...,kj-i,kji where kj e J kl '"' , ' ! '- 1,c , we define r n j in such a way that it takes constant 
values in {—Mj + 1, . . . , Mj — 1} on each of these intervals, such that (37) (resp. (38)) is 
satisfied, and such that these intervals Ik 1 ,...,k j ^ 1 ,k j are mapped onto the "remaining gaps" 

in Ti]Z , r 1 \h 1 ,...,k ] - 1 )- 

Similarly as in the previous section we thus well-define the function r n j which then 
verifies (80) and (81), with j — 1 replaced by j. 

We still have to define Tjj : [0, 1) — > Z. For 1 < ki < mi, . . . , 1 < kj-i < uij-\, we 
define Tj.j(x) on the intervals Ik 1 ,...,k j _ 1 .k j by 



aj(kj), 


for kj 


e{i,.. 


■,Mj-i} 




-Mj, 


for kj 


e {M,-_ 


.i + 1, . . . , {mj 


-l)/2}, 


o, 


for kj 


= ( m i - 


f l)/2, 




Mj, 


for kj 


G {(mj 


+ 3)/2,...,m j 


— -Mj'-i} 


^aj(kj), 


for kj 


G {mj 


- Afj-_! + 1, . . . 
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Similarly as in step j = 2 the {—Mj + 1, . . . , Mj — l}-valucd function cij(kj) is defined 
in such a way that maps the intervals /fe 1 ,...,fc^_ 1 ,fc j with kj G {1, . . . , Mj_i} U {rrij — 

Mj_i + 1, . . . , mj} to the intervals /fe ll ...,fc J _ 1 ,fe j , where fej runs through the "middle region" 

{(m, - l)/2 - M,_! + 1, . . . , ( mj - l)/2} U {( mj + 3)/2, . . . , (m, + 3)/2 + M,-_i - f }. 

We now deduce from Lemma 3.3 that, for x G Ik 1 ,...,k j ^ 1 ,k j 

^{x)+^{T^\x)) = 

'0, for kj G {Mj-i + 1, . . . , {mj - l)/2} 

U{(m J - + 3)/2,...,m J --M J _i}, 
2^ + 7(Mi, . . . , Mj_i), for fcj G {1, . . . , Mj_i} U { TOj - Mj_i + 1, . . . , m,}, 
,1, for kj = {mj + l)/2, 

where 7(Mi, . . . , Mj_i) denotes a quantity which is bounded in absolute value by a constant 
c(Mi, . . . , Mj_i) depending only on Mi, . . . , Mj_i. 
This completes the inductive step. 

We now define to = 0, n = 1 and, for n > 2 

r„ = lim r n _i,j. (82) 

It follows from (80) that, for each n > 2, the limit (82) exists almost surely provided the 
sequence {m n )^L 1 converges sufficiently fast to infinity, similarly as in section 3 above. The 
(r„)^ and the above constructed functions {ip n , ip n )^Li satisfy the assertions of Proposition 
4.1. The verification of items (i), (ii), and (iii) is analogous to the arguments of section 3 and 
therefore skipped. As regards assertions (iv) note that, for 1 < n < j the function T^"'^ 
maps the intervals Ik lt ...,k n -i onto themselves. It follows that Ta n ^ does so too, whence 

8{x,T^\x))<M-\ 

which readily shows (74). □ 
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